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A POLYNOMIAL BOUND ON SOLUTIONS OF QUADRATIC 
EQUATIONS IN FREE GROUPS 

IGOR LYSENOK AND ALEXEI MYASNIKOV 

Abstract. We provide polynomial upper bounds on the size of a shortest solution for 
quadratic equations in a free group. A similar bound is given for parametric solutions in 
the description of solutions sets of quadratic equations in a free group. 



1. Introduction 



Let G be a group and X a countable set of variables. An equation in G is an element E 
of the free product G * Fx where Fx is the free group freely generated by X. Usually we 

Ph \ write equations in the classical form E = 1 where ii^ is a word over G U X^^ representing 

a reduced product in G * Fx- Elements of G occurring in E are called coefficients of an 

^ • equation E = 1. A solution oi E = 1 in G is a homomorphism a : G * Fx — > G such 

c^ . that g"" = g for every g E G (so-called G -homomorphism) and -E" = 1. Sometimes it is 

convenient to consider the quotient Ge = G * Fx/{E = 1) of G * Fx modulo the normal 
subgroup generated by E, called the equation group of E = 1. In this case solutions oi E = 1 
in G are precisely G-homomorphisms G * Fx — > G. (Observe that ii E = 1 has a solution 
then G embeds into Ge-) In general, there are two natural problems concerning equations 

(3 I from a given class S- The first one is the famous Diophantine problem: does there exist an 

C"^ ' algorithm to check whether or not a given equation from S has a solution in a group G. The 

second problem is to get an effective description of the set of solutions of an equation in S- 
A word E and an equation E = 1 are termed quadratic if every variable x E X occurring 
in E (as x or x~^) occurs precisely twice. Quadratic equations form a very special class 
among equations of a general form in groups. However, they play an important role in 
several areas of mathematics. This is not very surprising since quadratic equations are 

K^ , naturally related to the topology of compact surfaces (see for example [71 [2l] and Section 

H I |2] below). Quadratic equations groups naturally appear in JSJ-decompositions of groups 

(via QH subgroups) and as group actions of dynamical systems (via interval exchange). 
Moreover, quadratic equations play a fundamental part in general theory of equations in 
groups and algebraic (Diophantine) geometry over groups [21 [13 IISI HH EB]- It has been 
shown by Kharlampovich and Myasnikov [151 [IS] that every system of equations in a free 
group is rationally equivalent to finitely many systems in non-degenerate triangular quasi- 
quadratic form (NTQ), which gives a precise analog of the elimination theory from the 
classical algebraic geometry, with proper extension theorems and nice algorithmic properties 
(for details see surveys [T71 HH]). NTQ systems are crucial in Implicit Function Theorems in 
free groups [H] and Tarski problems [191 [21] • 

The theory of quadratic equations in a free group F goes back to the works of Lyndon [20] 
and Malcev [22] , who described, correspondingly, all solutions of the equation x'^y'^z^ = 1 and 
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[x, y] = [a,b] (here x,y, z & X are variables and a,b & F are the generators of F). Malcev 
anticipated several modern techniques, introducing automorphic equivalence of solutions 
and focusing on minimal solutions in each automorphic orbit. In [30] Wicks gave a decision 
algorithm for the Diophantine problem for equations of the type W = g {g ^ F). He showed 
that the problem of solving such an equation can be effectively reduced to solving finitely 
many particular equations in a free monoid with involution (via "Wicks forms"). 

In [3], Comerford and Edmunds proved that the Diophantine problem for arbitrary qua- 
dratic equations in a free group F is decidable. Later Comerford and Edmunds [6] and 
Grigorchuk and Kurchanov [11] completely described solution sets of quadratic equations in 
free groups (the result will be formulated below in this section). 

In 1982, Makanin [23] proved that the Diophantine problem in free groups is solvable, and 
a few years later Razborov [25] gave a description of solutions of systems of equations in free 
groups. These two very influential papers shaped the modern theory of equations in groups. 
Note that the description of solutions sets of quadratic equations given in [HI [H] is a very 
special case of Razborov's description for general equations. 

Techniques for solving equations in free and related groups were instrumental in solution 
of some other decision problems in group theory: for example, the isomorphism problem in 
hyperbolic groups [271 [9], limit groups [3], and toral relatively hyperbolic groups [8]. 

In view of applications, the principal question concerning equations in groups is the time 
complexity of decision algorithms. It has been shown by Bormotov, Gilman, and Myas- 
nikov [1] that one-variable equations in a free group admit polynomial time decision algo- 
rithms. Ol'shasknii [23] and Grigorchuk and Kurchanov [TT] proved that if the number of 
variables is fixed then the Diophantine problem for quadratic equations in free groups has 
a decision algorithm polynomial in the sum of the lengths of the coefficients. However, this 
is as far as one can go in polynomial time. First, Diekert and Robson showed in [10] that 
the Diophantine problem for quadratic equations in free monoids (semigroups) is NP-hard. 
Then Kharlampovich, Lysenok, Myasnikov and Touikan proved that the Diophantine prob- 
lem for quadratic equations in free groups is precisely NP-complete [13]. Nevertheless, a 
few important questions remained to be open, above all, the question whether decidable 
quadratic equations in free groups have solutions polynomially bounded in the size of the 
equation. 

The affirmative answer to this question is given by the following theorem. Here a quadratic 
word Q is orientable if every variable x occurring in Q occurs in Q with two opposite 
exponents as x and x~^ and non- orientable otherwise, that is, if a variable occurs in Q twice 
with the same exponent -|-1 or —1. 

Theorem 1.1. Let Q be a quadratic word. If the equation Q = 1 is solvable in a free 
group Fa then there exists a solution a such that for any variable x, 

J Nn{Q)c{Q) if Q is orientable, 
~ \Nn{QYc{Q) if Q is non- orientable, 

for some constant N . Here n{Q) denotes the total number of variables in Q and c{Q) the total 
length of coefficients occurring in Q. One can take A^ = 40 for orientable Q and N = 150 
for non- orientable Q. 
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If Q is standard orientable or semi-standard non-orientable (see Definitions \2.2\ and \3.4\ ) 
then there exists a with a better bound 

, „, ] Nc(Q) if Q is orientable, 

\x \ < < 

\Nn{Q)c{Q) if Q is non-orientable, 

with N = 8 and N = 36 respectively. 

In a similar manner, we give a bound on the size of parametric solutions of quadratic 
equations which participate in the description of their solution sets. To state the result we 
need to define a concept of a parametric solution. Let T be an alphabet of parameters which 
is assumed to be disjoint from the alphabet of constants A and alphabet of variables X. A 
parametric solution of an equation E = 1 in F^ is an F^-homomorphism rj : -F4uvar(_B) ~^ 
FjUjt such that E^ = 1 where Var(i^^) denotes the set of variables occurring in E. If rj is 
a parametric solution oi E = 1 then for any F4-homomorphism ip : F^yjT ~^ Fa we get a 
solution rjip oi E = 1 in the usual sense, a specialization of rj. Note that we use here a notion 
of a solution of an equation ii^ = 1 in a free group Fa which is slightly different from one 
introduced above. Instead of taking F4-homomorphisms of Favjx we restrict them to the free 
group FAyjYa.T:{E) involving only variables which occur in E. This provides a more convenient 
way for describing all possible solutions of a given equation E = 1. 

Let Stab(-E') denote the group of all F^-automorphisms </> of FAvjva.r(E) such that E'^ is 
conjugate to E. This group acts on the solution set of F = 1 by left multiplications. Hence 
any parametric solution r] produces a whole bunch of solutions in the usual sense, the union 
of orbits of specializations of 77: 

Sol(r/) = {<Pr]u I (/) G Stab(F), u E RompAFAuT , Fa)} . 

The above mentioned result of Comerford-Edmunds [B] and Grigorchuk-Kurchanov [TT] 
asserts that for any quadratic equation E = 1 in Fa there is (and can be effectively produced) 
a finite set {rji} of basic parametric solutions such that the set of all solution of F = 1 is the 
union Uj So1(?7j). 

Let rj and 6 be two parametric solutions of the same equation E = 1 in Fa- Let us say that 
?7 is a generalization of 6 if there are an automorphism (p G Stab(F) and an endomorphism 
u G Endir^(FAur) such that 9 = (prju. Clearly, in this case we have Sol(?7) ^ Sol(6'). 

Theorem 1.2. Let Q = 1 be a quadratic equation in a free group Fa- Then any parametric 
solution of Q = 1 has a generalization t] such that for any variable x, the length of x^ is 
bounded by the same function as in Theorem \l.l\ with c{Q) replaced by c{Q) + 2n{Q). 

In particular, there is a finite set of basic parametric solutions of Q = 1 satisfying this 
bound. 

Note that a simple description of solution sets (in terms of basic parametric solutions) is 
known for coefficient-free quadratic equations. In this case, only one basic solution is enough. 
Moreover, if an equation is in the standard form then the value of each variable in the basic 
parametric solution is either a parameter letter or trivial, see [121 Section 5]. This basic 
parametric solution is a generalization of every other parametric solution, so the theorem 
does not give much in this case. 

The proof of Theorems 11.11 and 11.21 has three main constituents. In Section [3] we pro- 
duce various automorphisms that allow to transform quadratic words to a desired form. In 
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particular, we re-prove a well known fact (closely related to the classification theorem for 
compact simplicial surfaces, see for example [26| Chapter VI]) that a quadratic word can be 
equivalently transformed to a word belonging to one of the four standard series, see Propo- 
sition 12.11 However, in our proof, we provide an "economical" transformation (Propositions 
\3.3\ \3.5\ \3.7\ 13.91 and 13.101 ) As a corollary we formulate a general result on the bound on 
the complexity of automorphisms which reduce a given quadratic word to the standard form 
over any group G. 

Corollary 13.111 Let Q ^ G * Fx be quadratic word over an arbitrary group G. Then there 
are automorphisms 0, '?/' G AutciG * Fx) such that Q'^ and Q^ are conjugate to the standard 
quadratic words equivalent to Q and for any variable x, 

I (^1 I ^,-1, \An{Q) + 2c{Q) if Q is orientable, 

~ ]8n'^{Q)+An{Q)c{Q) if Q is non-orientable, 

where c{Q) is the total length of coefficients of Q expressed in any left-invariant (e.g. word) 
metric on G. 

Note that we provide here bounds for two different automorphisms: for the direct transfor- 
mation Q ^ R and for the inverse transformation R y Q where R denotes the standard 

form of Q. The reason is that a bound on a automorphism of a free group F does not 
imply a reasonable bound on its inverse 0"^; in particular, see Example 13.81 below. 

As the next step of our argument we develop a version of the elimination process for 
quadratic equations in a free group. To do this, we define a certain set of transformations on 
pairs of the form {Q, a) where Q is a coefficient-free quadratic word and a is an evaluation 
of variables in Q, i.e. an FA-homomorphism -FAuVar{Q) -^ Fa- Then we define a certain 
transformation sequence starting from a given pair {Q, a) whose primary goal is to eliminate 
cancellations in the formal word Q[a] obtained by substituting in Q of the value x" of each 
variable x. Our approach is essentially an improved version of similar approaches used in 
[H m [11] for description of solution sets of quadratic equations in a free group. We start in 
Section m with a lighter version of it, with no care about transformation homomorphisms, 
and prove Proposition 14.61 sufficient for the proof of Theorem 11.11 Our full version of the 
eliminated process is elaborated in Section E] for the proof of Theorem 11.21 

Note that there is an alternating (and essentially equivalent) approach to solutions of 
quadratic equations in free groups using Lyndon-van Kampen diagrams on surfaces, see for 
example [2l] . Though this approach provides a clear geometric vision, an advantage of using 
elimination process is an easier control of transformation homomorphisms. 

The final ingredient to the proof is a statement about Lyndon-van Kampen diagrams 
(Proposition 15. 4p which says that a diagram can be unfolded in an economical way. 



2. Quadratic words and surfaces 

In this section, we recall some elementary concepts and formulate some known facts about 
quadratic words. The free group Fa and a countable set X of variables will be fixed through- 
out the whole paper. The following fact is well known (see for example [121 Section 4]). 



Proposition 2.1. By an Fa- automorphism of Faux, every quadratic word can be reduced 
to one of the following forms: 

[xi,yi][x2,y2]---[xg,yg] (fl- > 0), 

xlxl...xl ig>0), 



[Xi, yi] [X2, ^2] • • • [Xg, yg] C1Z2 ^C2Z2 ■ ■ ■ Z^CmZn 



(^ > 0, m > 1^ 

.g ^,~^ ^.~. „ ^,n~,n {9>0, m > 1). 

Here Xi, yi and Zi are variables and Ci G Fa, Ci 7^ 1, are coefficients. 



22 2 —1 —1 

X^X2 ■ ■ ■ X C1Z2 C2Z2 ■ ■ ■ Z^^ CjYiZm 



Definition 2.2. A quadratic word Q and an equation Q 
to one of the four series given in Proposition 12. 1[ 



1 are called standard if Q belongs 



The proof of the proposition essentially repeats the proof of the classification theorem for 
compact surfaces, see for example a classical topology textbook [26l Sections 38-40] . In the 
next section we refine the proof and formulate a series of propositions that the reduction 
automorphism can be chosen economically in terms of its size. 

The series of quadratic words in Proposition 12.11 represent the four series of compact 
surfaces: closed orientable surfaces of genus g, closed non-orientable surfaces of genus g, 
orientable surfaces of genus g with m boundary components and non-orientable surfaces of 
genus g with m boundary components, respectively. 

The relation between quadratic words and surfaces relies upon the following simple ob- 
servation. Let Q be a quadratic word. Take a 2-disk D with boundary dD divided into \Q\ 
arcs. Choose an orientation of the boundary dD and label the arcs with letters of Q in 
the order they appear in Q (instead of x~^ we put x and direct the arc in the opposite 
to the orientation of dD). Then for each variable x occurring in Q, glue together the two 
arcs labelled by the two occurrences of x in Q, according to their orientation. We get a 2- 
complex Sq representing a compact surface. The surface is closed if Q is coefficient-free and 
has a boundary otherwise. (See Fig. [1] where Q = x~^ay~^hxcyd and Sq is homeomorphic 
to a torus with a disk removed.) 

X 




y 




D 



y 





X 




Figure 1. 

Directed edges of 2-complex Sq carry in a natural way labels in A^^UX^^. The edges in the 
interior of the surface are labelled by variables in Var((5) and edges in the surface boundary 
labelled by letters in A^^ which come from coefficients oi Q. In case of orientable Sq we fix 
a positive direction of passing along each boundary component which agrees with the chosen 
orientation of the boundary of disk D. In this case, we view the label of the boundary 
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component as an element of Fa defined up to conjugacy. If Sq is non-orientable then labels 
of boundary components are viewed as elements of Fa defined up to conjugacy and taking 
the inverse. 

The topological type of Sq may be different from the topological type of Sr where R 
is a standard form of Q from Proposition 12.11 It is not hard to see that if the label c of 
a boundary component of Sq represents the trivial element of Fa then the corresponding 
boundary component disappears in Sr. We introduce an extra reduction step for Sq: if c is 
trivial, attach a 2-disk along the corresponding boundary component of Sq. We denote Sq 
the resulting surface. 

Definition 2.3. Let Q be a quadratic word. We call the unordered tuple of all nontrivial 
labels of boundary components of Sq, viewed as elements of F4, the standard coefficients 
of Q. As remarked above, the standard coefficients are defined up to conjugacy if Sq is 
orientable and up to conjugacy and taking inverses if Sq is non-orientable. 

Definition 2.4. Two quadratic words Q and R are equivalent if Sq and Sr have the same 
topological type and their tuples (cn, . . . , Cmi) and (ci2, . . . , 0^2) of standard coefficients are 
the same up the the natural equivalence: if Sq is orientable then up to enumeration, each cn 
is conjugate to the corresponding Cj2; if Sq is non-orientable then up to enumeration, each cn 
is conjugate to either Cj2 or c^^. 

With a slight abuse of the language, we call a quadratic word Q itself orientable if Sq is 
an orientable surface and non- orientable otherwise. It is immediate from the construction 
of Sq that Q is orientable if and only if each variable in Q has two occurrences with different 
exponents —1 and -|-1. By the genus of Q we mean the genus of Sq. 

Thus, any quadratic word Q is defined up to equivalence by the following parameters: ori- 
entability (true/false), the genus g and the tuple of standard coefficients (ci, . . . , Cm)- Propo- 
sition 12.11 asserts precisely that any quadratic word can be reduced by an FA-automorphism 
of Fa\jx to an equivalent standard quadratic word. 

The following fact may be attributed to folklore. (We don't make use of it in the paper 
and provide only a sketch of the proof.) 

Proposition 2.5. Let Q,R E Faux be two quadratic words. Then the following statements 
are equivalent: 

(i) R is equivalent to Q; 

(ii) R is the image of Q under an Fa- automorphism of Faux! 
(iii) R is conjugate to the image of Q under an Fa- automorphism of Faux- 

Sketch of the proof: To prove implication (i)^(ii), we reduce two given equivalent quadratic 
words to equivalent standard forms by Proposition 12.11 It is then an easy exercise to 
show that two equivalent standard quadratic words are images of each other under an Fa- 
automorphisms oi Faux (one can use also automorphisms from Lemmas l3. 13l and l3. 14l below) . 
Implication (iii)=^(i) follows easily from the following statement: li Q,R G Faux are 
quadratic words and R is conjugate to Q^ for some G AutpAi^Aux) then may be 
represented as a product TiT2...Tk of elementary Nielsen F4-automorphisms Tj such that 
the cyclically reduced form of Q^^'^* is a quadratic word for each i. To prove the statement, 
we use the Higgins-Lyndon approach to stabilizers in Aut(F) as exposed in Lyndon and 
Schupp's book pi] Section 1.4]. We apply a modified version of Proposition 1.4.23 of [21] to 
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one cyclic word ui = Q and the tuple of non-cyclic words Mj, i > 2, consisting of all letters 
a ^ A. By this proposition, can be represented as a product 

= P1P2 ■ ■ -Pr 

of Whitehead automorphisms pi such that for some p and q, 0<p<q<r — 1, 

\Qi+i\ < \Qi\ for i = 0,...,p, \Qi+i\ = \Qi\ ioT i = p+l,...,q 

and 

\Qi+i\ > \Qi\ ioT i = q + l,. . .,r -1 
where, by definition, Qo = Q and Qi is the cyclically reduced form of Qp^P'^--P\ Then it is 
not hard to see that in each of the chains 

Qo —^ Qi —^ ■ ■ ■ —^ Qq and Qr > Qt~\ > ■ ■ ■ > Qq 

every automorphism pf^ can be factored into a sequence of elementary Nielsen automor- 
phisms which keep the property of a word being quadratic. D 

3. Transforming quadratic words 

In this section, we describe several specific transformations of quadratic words to produce 
equivalent quadratic words of a desired form. By a transformation we mean application of 
an F^-automorphism of Faux (or an F(7-automorphism of Fcux where Fc is the group of 
formal coefficients, see below). 

By |iy I we denote the length of an element of a free group written as a frely reduced word. 
The notation \W\x will be used for the total number of occurrences of a letter x in W. More 
generally, is S* is a set of letters then \W\s will denote the total number of occurrences in W 
of letters from 5*. Sometimes we consider formal (not necessarily freely reduced) words W. 
In this case, \W\ denotes the length of W. 

For automorphisms of a free group Fy, we use a notation 

(j) = (xf H-)- 14^1, X2' ^ W2, ... xl'' h^ Wk) where Xi eY, Ei = ±1, Wi G Fy 

which means that maps generator x^' to Wi and does not change other generators. 

We describe several types of elementary transformations applied to a quadratic word Q G 
Faux and producing an equivalent quadratic word Q'f', or, in a weaker form, an equivalent 
quadratic word conjugate to the image Q'^ oi Q. 

Permutations and exponent sign changes of variables: Permutations on the set of variables 
and automorphisms of the form [x i— )■ x~^). These automorphisms always carry quadratic 
words into equivalent ones. We will implicitly assume that automorphisms of this type are 
applied if needed. For example, to transform a word Qi to a given word Q2 it is enough 
to find an automorphism G Antp^iFAux) such that Qf is equal to Q2 up to renaming 
variables and changing their exponent signs. 

Multiplications by coefficients: Automorphisms of the form {x^ 1— >■ x^d), d G Fa- Any 
automorphism of this form carries any quadratic word to an equivalent one. We will use 
only the special case when x^d~^ or dx~'^ occurs in Q as a subword. We refer to these 
transformations as coefficient shifts. Geometrically, we shift the start of the edge labelled x^ 
along the boundary arc of Sq labelled d~^ from the start of this arc to its end (Fig. [2]). This 
automorphism does not change Sq unless the corresponding boundary component is labelled 
by the trivial element and there are two occurrences of (x^(i~^)^^ in Q (Fig. E]). In this case, 
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this boundary component disappears (but the topological type of the reduced surface Sq is 
not changed). 





Figure 2. 




X 




Figure 3. 

Introductions of new variables: Nielsen automorphisms of the form (x^ i— ?• x'^y^) where a 
variable y does not occur in Q. 

Related Nielsen automorphisms: Nielsen automorphisms (x^ \-^ x^y^) related to Q, that 
is, those for which {x^y~^)'^^ occurs in Q. Geometrically, we shift the start of the edge 
labelled x"^ along the edge labelled y^, see Fig. |H If (x^y"'')^^ occurs in Q^^ twice then y 
is eliminated from Q. If we view Q as a. cyclic word then for any Nielsen automorphism p 
related to Q, the image Q^ is conjugate to a quadratic word equivalent to Q. 




or 



A 




X 



Figure 4. 



It is sometimes convenient to view transformations of quadratic words from a slightly 
different angle — through formal coefficients. This means that we introduce a new alphabet C 
of formal coefficients and consider quadratic words in Q G Fcux with the property that each 
coefficient letter c & C occurs in Q at most once. To get a quadratic word in the usual sense 
(as an element of Faux) we have to provide it with a coefficient map Fc -^ Fa- 

Thus we have two ways of representing quadratic words: as an element of Fa\jx and as a 
pair (Q,7) where Q is a quadratic word with formal coefficients and 7 is a coefficient map. 
An advantage of the second way is that quadratic words can be viewed independently on 
the coefficient group which may be not necessarily free. We use this representation since it 
provides a more convenient accounting of lengths of coefficients occurring in transformations. 
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Note that all elementary transformations introduced above are applicable also to quadratic 
words with formal coefficients (just with Fa replaced by the formal coefficient group Fc). 
If a pair (Q, 7) represents a quadratic word R G Fa\jx then transformations of Q induce 
corresponding transformations of R. 

In the rest of the section we assume that quadratic words are ones with formal coefficients 
and belong to the group Fcyjx- 

We prove a series of statements asserting that a quadratic word can be reduced to a 
standard form using an automorphism of bounded complexity. Note that the notion of 
a standard quadratic word is essentially not changed when passing to words with formal 
coefficients. It is required only that any formal coefficient occurs in the coefficients q of a 
standard quadratic word at most once in total. 

Definition 3.1. We call automorphisms of the form [x^ — )■ Wx'^) where x does not occur 

in W , elementary. 

We say that an elementary automorphism (x^ t—)- Wx^) changes x and touches variables 
occurring in W . 

A product P1P2 ■ ■ ■ Pk of elementary automorphisms pi is triangular if it satisfies the fol- 
lowing condition: as soon as pi touches x, the variable x is not changed by all subsequent 
automorphisms Pi+i, . . . , Pk- 

The following observation is immediate: 

Lemma 3.2. Let 

= (xf ^ Wix\^){xl^ -^ W2XI') . . . {xl^ -^ Wkxl^) 

he a triangular product of elementary automorphisms of a free group F . For a generator x, 
let Wi^, Wi^, ..., Wi^ be all words Wi which participate in automorphisms (xj — ;■ WiXi) 
with Xi = X and Si = 1, and Wj^, Wj^, ..., Wj^ be all words Wi which participate in 
automorphisms {x~^ — ?• WiX^^) with Xi = x and ei = —1. Then 

x^ = W,,W,, . . . W,^xWr^Wr\ . . . Wj^K 

In particular, for any generator y ^ x, the number of occurrences of y in x'^ does not exceed 
the total number of occurrences of y in all Wi^. and Wj,. . 

Proposition 3.3. Let Q G Fcux be an orientable quadratic word. Then there exists an 
automorphism G Antpf^^FcuVariQ)) such that Q^ is conjugate to a standard quadratic word 
equivalent to Q and for any x,y E Var(Q) and c E C , we have \x'^\y < 4 and \x'^\c < 2. 
In particular, \x'^\ < 2\Q\ for any x G Var((5), 

Proof. We construct the required automorphism as a triangular product of elementary 
automorphisms. Starting from this point throughout the proof, we denote by Q the current 
quadratic word after application of a sequence of elementary automorphisms constructed so 
far. At the start, Q is any quadratic word from the hypothesis of the proposition. We view Q 
as a cyclic word and thus regard transformations up to conjugation. 

We assume that Q has at least one variable (otherwise the proposition is trivial). 

At any moment, there are locked variables in Q which have been touched by previous 
elementary automorphisms. They should not be changed by subsequent elementary auto- 
morphisms. 
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By Tq we denote the 1-skeleton of Sq, i.e. the graph embedded in the surface after the 
identification of arcs in the boundary of disk D as described in Section [2l 

Step 1: Ehminating boundary superfluous vertices. 

If Sq is a closed surface then Step 1 is void and we jump to Step 2. Recall that Sq is 
closed if and only if Q is coefficient-free. 

We call a vertex u in the boundary of Sq essential if it is an endpoint of an interior edge 
of Sq (i.e. an endpoint of an edge labelled by a variable). 

Let £ be a boundary component of Sq. Observe that i has at least one essential vertex 
(since Q has at least one variable). We describe a sequence of elementary automorphisns 
which result in exactly one essential vertex in £. Let ui, . . . , Uk be all cyclically ordered 
essential vertices in i. Let c G Fa be the label of the arc between z/i and 1^2 and xi, . . . ,Xr G 
X"^^ be labels of edges starting at z/2 so that cxi, a;]"^a;2, . . . , x~\xr occur in Q (see Fig. |5]). 
We apply to Q a sequence of elementary automorphisms 

{Xi H-). C~^Xi){x2 ^-^ C~^ 



X2 



tXj <r* ' ( (_■ tAJ f 



eliminating essential vertex U2. Then we proceed in the same way eliminating all other 
essential vertices i^3, . . . , z/^. 





Figure 5. 



We repeat the procedure for all other boundary components of Sq. After that, each 
boundary component will have exactly one essential vertex. Their labels are the standard 
coefficients of Q. There are no locked variables after this step. 

Step 2: Eliminating inner superfluous vertices. 

We choose a base vertex z/i of Tq as follows. If Q is coefficient-free we take any vertex 
of Tq. li Q has a coefficient then for the base vertex we take any essential vertex in the 
boundary of Sq. 

Suppose that Tq has a vertex i/ 7^ z/i in the interior of Sq. Let ei, . . . , e^ be all directed 
edges starting at u, labelled by variables Xi, . . . ,Xk € X"^^. Since u and z/i are connected by 
a path in Tq, at least one of Cj, say Ci, ends in a vertex u' distinct from u. Then we apply a 
sequence of elementary automorphisms 

ij = {X2^ X^^X2) . . . (Xfc H- X^^Xk) 

eliminating u. Observe that xi does not occur in the new word Q^, so the locked variable xi 
will not participate in the subsequent automorphisms. 

We perform elimination of all vertices z/ ^ z^i in the interior of Sq. After that, if Q is 
coefficient-free then Tq has only one vertex Ui. li Q has a coefficient then Tq has m vertices 
ui, . . . , z/m, one in each boundary component of Sq. 

Step 3: Collecting coefficient factors. 

We assume here that Q has at least one coefficient. If Q is coefficient-free, we jump to 
Step 4. 
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The step consists of a sequence of substeps 3i, 82, . . • , 3^-1- Before step 3^, Q has the 
form 

Q = CiZ2^C2Z2Z^^C3Z3 . . . zr^dZiW 

where Zj G X^^ and Cj £ F^ and W has no locked variables. Recall that Q is viewed as a 
cyclic word, so at the start of step 3 we have Q = CiW for some W and coefficient Ci. 

If no coefficients occur in W then we stop. Suppose that a coefficient occurs in W. 
Since Tq is connected, there is an edge in Tq starting at ui and ending in a boundary 
component of Sq distinct from ones labelled by Ci, C2, . . . , Q. Let Zi+i G X"^^ be such an 
edge (for convenience we identify edges with their labels) and z/j+i its endpoint in a boundary 
component labelled q+i. 

Step 3j consists of the following. Using related Nielsen automorphisms and coefficient 
shifts, we first shift starting vertices of all interior edges of Sq at i^j+i other than z~^-^ to 
a new position at ui along the path labelled z~^-f^ or the path labelled Ci+iz~^^ (see Fig. [H]). 
Next, if there are edges starting at z/i between Zi and Zi-^i, we shift them one by one along 
the path labelled 2;j+iCj+i2;j^\. 




Figure 6. 



After that, Q gets the form 



Q — '^\'^2 C2Z2Z3 C3Z3 . . . Z-_^-^Ci^iZi^iW 



and we iterate the procedure. Finally we come to a word of the form 

Q 



C1Z2 C2Z2 ■ ■ ■ Z^ Cm.Zm.R 



m '-■m^m-' 



where i? is a coefficient-free quadratic word with no locked variables. If R is empty we get 
the desired standard quadratic word. Otherwise we proceed to the next step 4. 

Observe that all edges of Tq labelled by variables occuring in R start and end at the same 
vertex z/i. In this case, we define a star word R* as the sequence of labels of edges labelled by 
variables in R when moving around ui in a small neighborhood of z/i. To fix the direction of 
the motion we agree that if x'^y^ occurs in R* then y~^x'^ occurs in R. If Q has a coefficient 
then R* is the word read off between the edges of Tq labelled the starting letter of ci and Zm 
(see Fig. [7]). If Q is coefficient-free then R = Q and we view R* as a cyclic word. 

Observe that R* is an orientable quadratic word of the same length and with the same 
variables as R. 

Step 4: Collecting commutators. We assume that Q has a coefficient; the coefficient-free 
case is similar with obvious minor changes. 
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Figure 7. 



At any stage of this step, we do not change the coefficient part of Q aheady collected, so 
Q has the above form 

Q = Ci22 C2Z2 ■ ■ ■ Z^ CmZmR 

where i? is a coefficient-free quadratic word. We will apply only related Nielsen automor- 
phisms involving variables of R. This implies that all edges of Tq labelled by variables in R 
start and end at the same vertex Ui. 

The whole step 4 is again an iterative sequence of smaller steps 4i, . . . , 4^. Before sub- 
step 4j, we have 



and 



R=[xi,yi]... [xi-i,yi-i]T 



R* = T*[y,^,,x-_\]...[y^,x^'] 



where T and T* are orientable quadratic words with Var(T) = Var(T*); in particular, \T\ = 

IT^* I 

We stop if T and T* are empty. 

Assume that T and T* are nonempty. Let Xj be a variable occurring in T, so 

T* = UxiVx-^W 

up to the exponent sign of Xj. 

We claim that at least one variable yt occurs in V exactly once. Indeed, if l^ is a (possibly 
empty) quadratic word then a quadratic word V lies between the two occurrences of Xi in T, 
that is, T = . . . Xj ^Vxf^ .... In this case the edge of Tq labelled Xi would wave distinct 
endpoints, a contradiction. 

Hence, up to interchanging Xi and yt and changing their exponent signs we have 



T* = Ziy- ^Z2XiZ^yiZiXi ^Z^ 
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The following sequence of automorphisms collects the commutator [yj, x^ ^] when applied 
toT*: 

T y Ziy. Z2Z^XiZ^yiZiXi 

> Ziy. XiZ3Z2ZrjyiZiXi 

{x.^XiiZzZ^Z^)-^) ry -1 ry ry ry ry -1 

> Ziy- XiyiZiZ:iZ2Zr,x^ 

{y,^y,(ZiZzZ-aZ^)-^) _^ _1 

y Z1Z4Z3Z2Z51/. XiyiX^ 

It is straightforward to check that for any Nielsen automorphism p related to T* there is 
a dual Nielsen automorphism p* related to T whose action on T agrees with the action of p 
on T*, that is, {T*)p = {Tp')*: If x^'y^ occurs in T* then we define 

(x" (^ xV^)* = (/ ^ ajV) and (/ ^ x'^'')* = (x" h-> yV). 

We observe that if p changes x and touches y, then the role of these variables in p* is 
interchanged. This implies that there is a sequence if) of related Nielsen automorphisms 
touching only Xj and yi which transforms T to a word T^ where 

{T^r = Z[y,,xT\ 

Then for some T', 

T'^ = [x,,|/,]T' 

as required. This finishes step 4j. 

After completion of this procedure Q gets the standard form. We get an automorphism 
reducing to such a form an arbitrary quadratic word from the hypothesis of the proposition. 

Calculation of the hounds for (p. Fix a variable x G \ax{Q). Let Wi, W2, ■ ■ ■ Wk be the 
list of all words W in elementary automorphisms (x^ — )■ Wx^) which are the factors of </>. 
We claim that any variable y ^ x occurs in all Wi in total at most 4 times. Indeed, at each 
individual step 2, 3 or 4, y occurs at most twice in the automorphisms of the form (x — )■ Wx) 
and at most twice in the automorphisms of the form (x~^ — )■ Wx~^). It remains to observe 
that if y participates (i.e. occurs in some W) at step 2 or 3 then y does not participate in 
subsequent steps. By Lemma 13.21 we get 

\x%<A. 

From the construction it is easy see also that each constant c E C occurs in Ix**^! at most 
twice. This implies 

\xX<2. 

This finishes the proof of Proposition 13.31 D 

Definition 3.4. We call a non-orientable quadratic word Q semi- standard if Q has one of 
the following forms 

Q = X1X2 ■ ■ ■ X;j[Xfc+i, yk+i\ ■ ■ ■ [Xn, Unl 

or 

where factors [xj,|/j] are not obligatory. 
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Proposition 3.5. Let Q G Fcux be a non-orientable quadratic word. Then there exists an 
automorphism G AntFciF^cuYariQ)) such that Q"^ is conjugate to a semi-standard quadratic 
word equivalent to Q and for any x,y E VaT{Q) and c E C , we have \x'^\y < 4 and \x'^\c < 2. 
In particular, \x'^\ < 2\Q\ for any x G Var((5). 

Proof. The steps 1-3 are the same as in the case of orientable Sq. After performing these 
three steps we get 

Q = C1Z2 C2Z2 • • • ^m C^m^mR 

where R is now a non-orientable coefficient-free quadratic word. The rest of reduction 
consists of the following step. 

Step 4°: Collecting squares and commutators. 

As in the orientable case, we will work with the star word R* . Unfortunately, the definition 
of R* given in the proof of Proposition 13. 31 suits only for orientable R since it always produces 
an orientable R* . We modify the definition. 

For any edge e of S'q, we fix its orientation which is a choice of the positive direction 
of crossing e inside Sq. When passing around the base vertex z/i, we read the label x of e 
with exponent +1 if we cross e in the positive direction and —1 otherwise (so the exponent 
signs of variables in R* are not related directly to the exponent signs of the corresponding 
occurrences in R). It is easy to check that application of a related Nielsen automorphism 
p = {x'^ I— >■ y^x'^) to R* agrees with application of a dual Nielsen automorphism p* related 
to R that changes y and touches x. 

Let e be an edge of Tq labelled by a variable x G Var(i?). If e reverses the orientation when 
viewed as a loop in Sq then we cross e twice in the same direction when passing around Ui. 
Hence the both occurrences of x in R* have the same exponent sign. Since there is at least 
one orientation-reversing e, i?* is a non-orientable quadratic word. Thus, up to the exponent 
sign of X, R* has the form 

R* = Z1XZ2XZ3. 

We then collect the square of x: 

R* ^ ZixZ^^Z2X '—^ ZiZ^^Z-sx^ 

If ZiZ2^Z3 is non-orientable then we repeat the procedure collecting the square of a variable 
to the right. Otherwise we apply the procedure of collecting commutators described in Step 4 
in the orientable case. Using a triangular sequence of elementary automorphisms we finally 
reduce Q to the form 

Q = CoZ^^CiZi . . . Z~^_^Cm-lZm-lxl . . . Xfc[Xfc+i, yk+l] ■ ■ ■ [Xr, 1/r] 

The bound on is obtained in the same way as in the orientable case. D 

To reduce a semi-standard non-orientable quadratic word to a standard form we need an 
extra transformation. 

Lemma 3.6. There are automorphisms rji^jOk G Aut(F{a;Q^ ,2.^.,^^,...,^^.}) such that 

2\ 1 r 1 ^fe'^fe> 222 22 

Xo[xi, yij . . . [xfc, yk\ > a;oXi?/i . . . x^^j^, 

and for any x G {xq, . . . , Xfc, t/i, . . . , yk), 

|x'"= I < 4A; + 1 and \x^^^ | < 3A: + 2. 
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Proof. Let ■y{x,y, z) be an automorphism of F{x,y,z} such that 

{x^[y,z\y = x^y'^z^. 

Then taking 

Vk = l{xo, xi, yih{yuX2, ^2) • • • liVk-i, Xk, yu) 
we obviously get 



2r 1 r 1 ''fe, 2 2 2 22 

Xo\xi,yi\ . . . [xfc, yk\ — > x^x^y^ . . . x^yk 



For a specific 7, we take 



7 = (x I— i- x^yzx "^, y I— !■ x|/2;a; ""^j 2; 1— >■ xz). 



The bound ||?7jt|| < 4/c + 1 is straightforward. 

To define dk we proceed in a similar way by taking for 7 another automorphism 



with 



7 = (x t-> x?/2, y ^^ z ^y ^x ^yzxyz, z ^-^ z ^y ^x ^z) 



7 "^ = (x I— )■ x^y ""^x '^, y i-^ xyx ^z ^x ^, z \-^ xz) 



D 



Proposition 3.7. Lei Q G -Fcux ^e a non-orientable quadratic word. Then there exists an 
automorphism (p G Auti7'(^(FcuVar(Q)) such that Q"^ is conjugate to a standard quadratic word 
equivalent to Q and for any x G Yax{Q) we have \x'^\x < 8n((5) genus (Q) and \x^\c < 2 for 
any formal coefficient c E C . 

Proof. Proposition 13.51 and Lemma IXBl produce an automorphism G AutpciF'cuVa.riQ)) such 
that Q"^ is conjugate to a standard quadratic word equivalent to Q and 

Ix'^lx <4n(g)(4A; + l) and \x% < 2 

where k is the number of commutators in the semi-standard form given by Proposition 13.51 
It remains to notice that 

k < -(genus(Q) - 1). 

D 

We turn now to bounds similar to Propositions 13. 3^ 13.51 and 13.71 where we estimate the 
size of the automorphism inverse to </>. Note that the sizes of an automorphism </> of a free 
group F and of its inverse (f)~^ can be very different. We give an example where the ratio is 
exponential in the rank of F. 

Example 3.8. Let F = F^xi,...,x„}- Define an automorphism G Aut(F) by 

_ 

■'^21+1 ~ Xi^iXiXi^2Xi~l ■ ■ ■ X2iXiX2i+l, 

4> _ 

X2J — XjXj_|_iXj_iXj_|_2 . . . XiX24 

Then \\(j)\\ = n where, by definition, ||0|| = maxj |xf |. It is not hard to see that ||0~^|| = 2". 

Proposition 3.9. Let Q G Fqux be any quadratic word. Then there exists an automorphism 
ip G Autp^{FcuYa.r{Q)) such that Q^ is conjugate to a quadratic word R equivalent to Q and 
the following assertions are true: 

(i) R is standard if Q is orientable and semi-standard if Q is non-orientable. 
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(ii) For any x,y E Var((5) and c G C , we have jx'^ Ij/ ^ 4 and \x^ |c < 2. In particular, 

Proof. Similarly to the arguments used in the proof of Propositions 13.31 and I3.5[ we view Q 
as a cyclic word and construct ip so that the inverse automorphism ip~'^ will be a triangular 
product of elementary automorphisms. The condition that ijj~^ is triangular is equivalent to 
the condition that ip is reverse triangular in the following sense: if a variable x is changed 
by some elementary automorphism in the product then x is not touched by subsequent ele- 
mentary automorphisms. If fact, the proof will be simpler than the proof of Propositions 13.31 
and l3.5l because there is no need to pass to the star word and we will operate on the quadratic 
word Q itself. 

According to the change in the notion of a triangular product, we change the notion of a 
locked variable: a variable x is viewed as locked at a current transformation step if it was 
previously changed by an elementary automorphism. Subsequent elementary automorphisms 
should not touch locked variables. Similar to the proof of Propositions 13. 31 and 13. 5^ we collect 
locked variables to a fixed part of Q. Thus, at any moment Q has the form LR where L is 
the locked part of Q and R has no locked variables. At the start, L is empty and since Q is 
viewed a cyclic word, we assume without loss of generality that R starts with a coefficient 
letter. 

We proceed in two steps. 

Step V: Reducing the coefficient-free part. 

If R is non-orient able then R = Z1XZ2XZ3 for some variable x. Using a sequence of 
elementary automorphisms similar to one given in Step 4^^ in the proof of Proposition 13.51 we 
collect x'^ to the left of R and add it to the locked part. 

If R is orientable and there are variables x and y which "cross" in R, that is, R = 
Zix~^ Z2y~^ Z^xZ^y Z^ (up to exponent signs of x and y) then we collect the commutator 
[x, y] to the left of R as in Step 4 in the proof of Proposition 13.31 

Iterating the procedure, we reduce Q to the form 

Q = xf... xl[xk+i, yk+i] ■■■[xr, yr]R 

where R is orientable and has no crossing variables. 

Step 2': Collecting the coefficient part. 

If R has no variables then we are done. Let Var(i?) 7^ 0. By the assumption that R has 
no crossing variables, there is a variable z with no variables between the two occurrences of 
z in R. Then we have R = Tiz~^czT2 where c G -F4 is a standard coefficient of Q. Applying 
automorphism [z 1— ?■ zT2^) to Q we shift z~^cz to the right of R. Iterating the procedure we 
get 

Q = Xi . . . X^[Xfc+i, yk+l\ ■ ■ ■ [Xr, yr\ClZ2 C2Z2 ■ ■ ■ Z^ CmZm 

It remains to observe that at any transformation step, we preserve the property that R starts 
with a coefficient letter and so Ci 7^ 1. 

The required bounds {x"^ !»/ ^ 4 and {x"^ |c < 2 are straighforward in view of Lemma [3l2] 
(applied to ip'^). □ 

From Proposition 13.91 and Lemma 13.141 we get a dual version of Proposition 13.71 

Proposition 3.10. Let Q G Fqux be a non- orientable quadratic word. Then there exists 
an automorphism ip G Antpf^^FcuYariQ)) such that Q^ is conjugate to a standard quadratic 
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word equivalent to Q and for any x,y E Var(Q) and c E C, we have \x'^ \y < 8genus(Q) 
and |x^ \c < 4genus((5). 

Passing from quadratic words with formal coefficients to quadratic words Q E G* Fx over 
a group G we can easily formulate a general result for an arbitrary coefficient group G. 

Corollary 3.11. Let Q E G * Fx be quadratic word over an arbitrary group G. Then there 
are automorphisms (j),ip E AutG'(G * Fx) such that Q'^ and Q^ are conjugate to standard 
quadratic words equivalent to Q and for any variable x, 

I 01 I ^-1, \An{Q) + 2c{Q) if Q is orientable 

~ y'Sn^{Q) + An{Q)c{Q) if Q is non- orientable 

where c{Q) is the total length of coefficients of Q expressed in any left-invariant (e.g. word) 
metric on G. 

Proof. For orientable Q this follows from Propositions 13.31 and 13.91 If Q is non-orientable 
then we have to use Propositions 13.71 and 13.101 and inequality genus((5) < n{Q). D 

In the end of the section, we formulate several lemmas for later use. The first one was in 
fact proved in the proof of Proposition 13.91 (unlike the proposition, we do not make a passage 
to a conjugate element here). 

Lemma 3.12. (i) Let Q G Fx be a coefficient-free orientable quadratic word. Then 

there is an automorphism ip G Aut(Fx) such that 

Q'f' = [xi,yi]... [xg,yg] 

and \xf \x, lyf |x < 4 for all i and x G Var((5). 
(ii) Let Q G Fx be a coefficient-free non-orientable quadratic word. Then there is an 
automorphism tp G Aut(Fx) such that 

Q = Xi . . . Xj.[Xk-\-l, yk+l\ ■ ■ ■ [Xk+n, yk+n\ 

and \xf \x, \yf \x < 4 for all i and x G Var((5). 

Lemma 3.13. Let m > 1 and a be a permutation on {1, . . . ,m}. Then there is an auto- 
morphism ip o/-F{ci,...,c™,2i,...,z,„} such that of = Ci for all i, with the following properties: 
(i) 

—1 —1 —1 '/' —1 —1 —1 

(ii) For any i, the image zf of Zi has the form 

jP _ -I _i 

Zi — ZiZ^f^-^^Ca{j^)Za{j^) ■ ■ ■ Z.^(^j^^^Ca{j^)Zrj{jf.) 

for some increasing sequence of indices 1 < ji < ■ ■ ■ < jk ^ f^- 
Proof. Using the automorphism 

Zi ^iZiZj ^j Zj ^ Zj CjZjZj^ CiZi 

we can permute two neighboring factors of the form z'-^CiZi. Starting with z~}.Co-{m)Za{m) 
we arrange factors z~'^CiZi to the right in the order as they should occur in the desired image 
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of z-^ ^ciZi . . . z^CmZm- We get a triangular product of elementary automorphisms and all zf 
have the required form as stated in (ii). D 

Lemma 3.14. Let 

Ql = X-^^ . . . X^[X/fc_|_i, Xk+2\ ■ ■ ■ Fg-1, Xg\Zi C1Z1Z2 C2Z2 • • • ^m ^m^m 

and 

he two non-orientahle quadratic words of the same genus g with the same set of coefficient 
letters {ci, . . . , Cm}- Then there is an automorphism ip o/F{ci,...,c™}uVar(Qi) such that of = Ci 
for all i, with the following properties: 

(i) Qi is conjugate to Q2- 
(ii) For any variable y G {xi, . . . ,Xg,Zi, . . . , Zm}, 

\y^\Y<4:g, \y^\z<^m and \y^\^^ <2, i = I, . . . ,m 

where Y = {xi, . . . ,Xg} and Z = {zi, . . . , Zm}- 

Proof. We use the following set of automorphisms: 

01 = (a; t-T- y'^zxy^'^, y t-)- yzxy^^, z i-)- yx), 

02 = (a;H-> [y,z]x[y,z]~'^), 



x'^[y,z 


% 


2 2 2 

y z X , 


x^[y,z 


4>2 


y,z]x'^, 


2 2 2 

X y z 


4>z 


y,z\x^, 


x^z~^cz 


4i\ 


z-'c-^zx" 



^3 



[x t-> [y, z]xy, y *-> y ^x ^z ^ , z v-^ zx) 



04 = (x H- 2; ^c ^zx, z \-^ zx), 

2 -1 <^5> -1 2 X / . 2\ 

X z cz — y z czx , 05 = (z I— )■ zx j. 

We pick up Xg with q = min{/c, r} and move it to the right of Qi so that the square/commutator 
part becomes the same as in Q2 and each constant q gets the exponent sign Si as in Q2. The 
bounds in (ii) are straightforward. D 

4. An elimination process for quadratic words 

There is a general approach to solutions of equations in free and similar groups which 
may be called elimination process. It usually deals with pairs {E, a) where i? = 1 is an 
equation in a group F and a is its solution. On the set of such pairs, a certain set of 
transformations is defined. The idea is to reduce step-by-step the cancellation which appears 
after substituting a into E and then, starting from a given pair {E,a), to get a new pair 
{Ei,ai) of bounded complexity. In particular, solvability of equation E = 1 is reduced to 
existence of a solution of bounded complexity of finitely many such equations £"1 = 1 which 
can be algorithmically checked. To describe the solution set of -E = 1 one needs in addition to 
track transformations homomorphisms. (There are several ways to define them; for example, 
in the case of a free group F = F^ a. transformation homomorphism from a pair {E, a) to 
a pair {Ei,ai) can be viewed as an endomorphism : Faux — ^ E^ux such that E"^ = Ei 
and a = 0ai.) For free groups, an elimination process has been applied in its full strength 
by Razborov [25] to provide a description of solutions of an arbitrary system of equations in 
free groups. In the case of quadratic equations, there is a much simpler version in [51161 fTT]. 
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In this section, we apply a version of the ehmination process to pairs of the form (Q, a) 
where Q G Fx is a coefficient-free quadratic word and a is an evaluation of variables of Q, 
i.e. a homomorphism Fx -^ Fa- We use the process to give a bound on the size of some 
solution of a related quadratic equation and thus do not need to track transformation homo- 
morphisms. A more sophisticated version (with tracking transformation homomorphisms) 
will be used in Section E] to obtain bounds on the size of parametric solutions. 

Let {Q, a) be a pair where Q G Fx is a coefficient-free quadratic word and a : Fx -^ Fa 
is an evaluation of variables of Q. We define several types of elementary transformations 
which carry {Q,a) to a new pair {Qi,ai). In all cases, there will be a transformation 
homomorphism G End(Fx) such that Qi = Q'^ (but we do not require that a = ai(j)). As 
one of the main inductive parameters, we take the length of a formal word Q[a] obtained 
by substituting in Q values x°' of all variables x G Vai{Q) represented by freely reduced 
words. It is straightforward to check that |(5i[ai]| < IQlct]! for all elementary transformations 
introduced below. 

Degenerate transformation. Application condition: x" = 1 for some x G Var((5). We do 
not change a and apply the homomorphism (j) = {x ^ 1) to Q, so Qi is obtained from Q by 
removing all occurrences of x and performing subsequent cancellation. 

We introduce a natural notion related to this transformation. We say that a coefficient-free 
quadratic word Q2 is a homomorphic image of Qi if there is an endomorphism G End(-Fx) 
with Q{ = Q2. The following fact is an easy exercise. 

Proposition 4.1. Let Qi and Q2 be coefficient-free quadratic words. Then Q2 is a ho- 
momorphic image of Qi if and only if either Qi and Q2 are of the same orientability 
and genus{Q2) < genus{Qi) or Qi is non-orientable, Q2 is orientable and genus{Q2) < 
\genus{Qi). 

Cancellation reduction. Application condition: a non-trivial cancellation in Q[a\ occurs 
between the values of two neighboring variables x^ and y^ (e, 5 = ±1). There are two cases. 
Case 1: x ^ y. Then for some u, v and w ^ 1, 

(x^)° = uw and (/)" = w'h! 

where equality stands for graphical equality of words. We take a variable z ^ Var(Q) and 
define a transformation homomorphism by 

= (x^ t-)- x^z, y^ t-)- z~^y^). 

To define ai we set 

(a;^)°i = u, (/)"i = V, z"' = w 

and h"'^ = h" for all other variables h ^ x,y, z. 

Case 2: x = y. Then x'^y^ becomes (x^)^ and for some u and w 7^ 1, 



In a similar manner, we take 
and define ai by 



(x^)" = w ^uw. 



(x^ H- z ^x'^z) 



ix'T^ = M, 2"! = W. 



After application of this transformation we get a pair (Qi,ai) which satisfies the strict 
inequality |Qi[ai]| < |Q[«]|. 
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Splitting a variable. Application condition: x is a variable in Q with x" 7^ 1. We introduce 
a new variable y ^ Var((5) and apply to Q the substitution = (x 1— )■ xy). For ai, we take 
any homomorphism Fx — j- Fa such that x"^?/°^ = x", the product x'^^y"^ is reduced and ai 
coincides with a on all variables other than x and ?/. 

As an illustration, we prove a well known fact. 



Proposition 4.2 ([30]). Let w G -Fa be a value of a coefficient-free quadratic word Q. Then 
there exists a homomorphic image R of Q and an evaluation (5 : Fx — )■ Fa such that x^ ^ 1 
for all X G Var(i?), the word R[(S\ is freely reduced and equals to w. 

Proof. Let w = Q". We start with the pair (Q, a) and apply degenerate transformations 
and cancellation reductions until possible. For the resulting pair (i?, /3), R and /3 are as 
required. D 

Corollary 4.3. (i) Ifw ^ Fa is a value of a coefficient-free orientable quadratic word of 

genus g then w is a product of g commutators [ui, Vi] . . . [ug, Vg] with \ui\, \vi\ < 2\w\ 
for all i. 
(ii) If w & Fa is a value of a coefficient-free non- orientable quadratic word of genus g 
then w can be represented as a product uf. . . ul[uk+i, Uk+2] ■ ■ ■ [ug-i, Ug] where \ui\ < 
2\w\ for all i. 

Proof. Let Q be the quadratic word from the hypothesis and let w = v~^WiV where Wi 
is cyclically reduced. By Proposition 14.21 and Lemma 13. 12^ there is a standard orientable 
or semi-standard non-orientable quadratic word R and an evaluation (3 of variables in R 
such that i? is a homomorphic image of Q, R^ = wi and {x'^l < 2\wi\ for every variable 
X G Var(i?). If needed by we add to R extra commutators or squares with new variables 
y with values y^ = 1 so that R becomes equivalent to Q. The required u^s and Wj's are 
obtained by conjugating the values x'^ with v. D 

In what follows, we apply an elimination process to a pair (Q, a) where a is a solution of 
a standard quadratic equation written in the form 

(1) Q = 2i C1Z1Z2 C2Z2 ■ ■ ■ Z^ CmZm- 

Our goal is to find a solution of this equation satisfying the bound from Theorem 11.11 To 
do this, using elimination process we find first a short (in a certain sense) solution of an 
equivalent quadratic equation of the same form, for given fixed elements ci, . . . ,Cm G Fa- 
Then, by using an "economical" automorphism from Section [3] we reduce the equation to 
the ininial form thus producing the required "short" solution of the original equation ([1]). 

Instead of keeping fixed the element Q" during transformations as in the proof of Propo- 
sition 14.21 we will keep the property that Q" is a product of conjugates of fixed elements 
Ci,...,Cm G Fa (that is, w = Ui^CiUi . . .u^CmUm for some Ui,...,Um), up to changing 
exponent signs of Cj in the non-orientable case. 

Definition 4.4. Let w, Ci, C2, . . . , c^ G Fa be cyclically reduced elements of Fa and let w 

be a product of conjugates of Ci, . . . , Cm- We say that w is a short product of conjugates of 
Ci, . . . , Cm ii w, viewed as a freely reduced cyclic word, does not have a form w = uhvh~^ 
where h is non-empty and the set {c,} can be properly partitioned into two subsets {cp^} and 
{cg-} so that u is a product of conjugates of Cp.'s and f is a product of conjugates of Cg.'s. 
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Definition 4.5. (non-orientable version) Let w,ci,C2, ■ ■ ■ ,Cm € Fa be cyclically reduced 
elements of Fa- We say that w is a unsigned product of conjugates of ci, . . . , Cm if w is a 
product of conjugates of c^^, . . . , c^ for some ei, . . . , e^ = ±1- 

We say that w is a short unsigned product of conjugates of Ci, . . . , c^ if w, viewed as a 
freely reduced cyclic word, does not have a form w = uhvh^^ where h is non-empty and the 
set {cj} can be properly partitioned into two subsets {cp-} and {cq^} so that m is a unsigned 
product of conjugates of Cp-'s and u is a unsigned product of conjugates of c^-'s. 

It is not hard to prove that if w is a short or short unsigned product of conjugates of 
ci,...,Cm then then \w\ < X]k«l- This is essentially a consequence of the van Kampen 
lemma, see Lemma [5.51 below. 

Proposition 4.6. Let ci,C2,...,Cm G Fa and Q be a coefficient-free (orientahle or non- 
orientable) quadratic word of genus g. Suppose that the quadratic equation 

(cJ Z-^ C\Z\ . . . Zjyi^ CmZm 

has a solution in Fa- 

(i) If Q is orientahle then a short product of conjugates of Ci, - - - , Cm is a product of at 

most g commutators in Fa- 
(ii) If Q is non-orientable then a short unsigned product of conjugates of Ci, . . . , c^ is 
a product of at most g squares in Fa- 

Proof- We consider first the case of orientable Q- 

Let a be a solution of the equation from the hypothesis of the proposition. We describe 
a sequence of transformations starting with the pair [Q, a)- Any moment we will have 
a coefficient-free quadratic word R and a homomorphism /3 : Fx — > Fa such that R is a. 
homomorphic image of Q and R^ is a product of conjugates of Ci, . . . , c^- For the inductive 
parameter that will be decreased during transformations, we take the pair (|i?^|, |i?[/3]|) 
with lexicographic ordering. Recall that \R^\ denotes the length of the freely reduced word 
representing R^ and |-R[/3]| denotes the length of the formal word R[P]- 

If x^ = 1 for some variable x G Var(i?) then we apply the degenerate transformation 
(x I— >■ 1) to {R, (3) decreasing the number of variables in R- The element R^ and the formal 
word i?[/3] are not changed. 

If -R[/3] has a cancellation then we apply a cancellation reduction so that R^ is not changed 
but the length of R[l3\ decreases. 

If -R[/3] has a cyclic reduction then we change /3 by conjugating all the values x^ with the 
same element of Fa decreasing the length of R^ - Observe also that using this operation we 
can change R^ to its any cyclic shift not increasing parameter (|-R^|, |-R[/3]|). 

Thus we assume that -R[/3] is cyclically reduced (and hence is the cyclically reduced form 
of R^) and all variables x m. R have non-trivial values x^ ^ 1- 

Suppose that R^ is not a short product of conjugates of Ci, . . . , c^, that is, up to a cyclic 
shift we have R^ = uhvh~^ where u, v and h ^ 1 are as in Definition 14.41 Splitting variables 
if needed we may assume that h and h~^ are values of single variables in i?, that is, 

R = UxlVxi U^ = u, {xlf = h, V^ = V, {xif = h-\ 

We replace /3 by a new evaluation /3i such that the length of cyclically reduced form of R^ 
decreases and i?^^ is still a product of conjugates of Ci, . . . , Cm- After that we go back to the 
start of our procedure replacing i?^^ by its cyclically reduced form. There are two cases. 
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Case 1: Xi = X2. We define x^^ = 1 and leave the values of all other variables unchanged. 

Then i?^^ = (UV)'^ = uv and by the condition of Definition l4.4[ R^^ is a product of conjugates 

of ci,. .., Cm- 
Case 2: xi 7^ X2- Then Xi occurs either in U or in V . Suppose that U = t/iX^^f/2 (the 

case when xi occurs in V is similar). We take x^^ = {{U2U1YY ^^"^ V^^ — V^ ^r all other y. 

Then 

and hence R^^ is a product of conjugates of Ci, . . . , c^ but now we have |-R^^| < \R^\ — 2|/i|. 

The description of the transformation sequence is finished. After finitely many steps we 
get a pair (i?, f3) such that i?^ is a short product of conjugates of ci, . . . , Cm- Since _R is a 
homomorphic image of Q, it is a product of at most g commutators in Fx- This proves (i). 

In the non-orientable case, the argument is similar with the difference that we keep -R^ 
being an unsigned product of conjugates of ci, . . . , Cm- (In Case 2, if xi occurs in R twice 
with the same exponent e then some exponent signs of Ci,...,Cm are changed after the 
transformation.) D 

5. Unfolding Lyndon-van Kampen diagrams 

Using Proposition 14.61 and automorphisms from Section [3] it is not difficult to get a bound 
on the size of the shortest solution of a standard quadratic equation in F4. The bound is 
Nn{Q)c{Q) in the case of orientable Q and and Nn{QYc{Q) in the case of non-orientable Q. 
To improve this bound by factor n{Q) we prove an extra statement. Informally speaking, it 
says that a Lyndon-van Kampen diagram can be unfolded in an economical way. 

We recall some definitions and facts about Lyndon-van Kampen diagrams (or simply 
diagrams from now on for brevity). 

By a diagram we mean a finite 2-dimensional cell complex A embedded in the plane M^ 
and endowed with a labelling function A over an alphabet Y. The latter means that for 
any directed edge e of A the label A(e) is fixed which is either a letter in Y^^ or is empty. 
For any two mutually inverse directed edges e and e~^, we have A(e"^) = (A(e))~^. The 
labelling function is naturally extended to paths (viewed as sequences of directed edges) in 
the 1-skeleton A*^^^ of A. The label A(p) of a path p is a word in F^^ which we will often 
identify with an element of Fy. We call the label of the boundary loop of a 2-cell D of A 
the boundary label of A, defined up to a cyclic shift. 

We assume that all diagrams are connected and simply connected. We assume also that 
a diagram A is endowed with a fixed base vertex uq in the boundary of A and, moreover, 
a boundary loop of A at i/q is fixed. (In general, the boundary loop starting at a given 
boundary vertex may be not unique). The label of the fixed boundary loop of A is called 
the boundary label of A. 

We admit that the boundary label of a 2-cell D oi a diagram A is the empty word or 
has the form yy~^, y & Y. In this case we call D a trivial 2-cell of vertex or edge type, 
respectively. Otherwise a 2-cell is called nontrivial. The boundary label of a nontrivial 2-cell 
is always assumed to be cyclically reduced. 

If words Ci, . . . , Cm are boundary labels of all nontrivial 2-cells of A then we call A a 
diagram over the set {ci, . . . , Cm}- (To be more formal, {ci, . . . , Cm} should be viewed as a 
multiset since we admit that some Cj are repeated.) 
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Let w,Ci, . . . ,Cm G Fy. A variant of the van Kampen lemma says that w is a product of 
conjugates of ci, . . . , c^ if and only if there is a diagram A with labelling function over Y 
over the set {ci, . . . , Cm}, with boundary label w. The proof can be found in [211 proof of 
Theorem V.1.1]. Note that cut off operations of spherical diagrams in the construction of A 
can be avoided since we admit empty labels of edges and trivial 2-cells. 

We need a precise description of the process of constructing a diagram from a represen- 
tation of an element w as a product of conjugates of Ci, . . . , Cm- We start with describing 
several elementary operations applied to a given diagram A which produce a new diagram Ai 
over the same set {ci, . . . , Cm}- 

(Tl) Contracting a trivial edge. If e is an edge with distinct endpoints and A(e) = 1 then 
we contract e into a vertex. 

(T2) Contracting a trivial 2-cell. Let D be a trivial 2-cell of A. Assume that either D has 
vertex type and the boundary loop consists of one edge or D has edge type and the boundary 
loop of D consists of exactly 2 edges and is labelled yy~^, y eY . Then we contract D to a 
vertex or to an edge labelled y, respectively. 

We admit also inverse operations (Tl)~ and (T2)~ , introducing a trivial edge and intro- 
ducing a trivial 2-cell, respectively. We call operations (Tl), (T2) and their inverses trivial 
transformations. We introduce another two types of elementary operations which we call 
elementary reductions. 

(Rl) Folding boundary edges. Assume that two distinct boundary directed edges ei and 62 
of A have the same label, a common initial vertex v and distinct terminal vertices. We 
assume furthermore that the path (6162)^^ occurs in the boundary loop of A. (This is not 
always true in the case when v is the base vertex of A.) Then we perform folding of ei and 62 
into a single edge. 

(R2) Removal of a leaf edge. Let e be an edge of A with an endpoint v of valence 1. We 
assume that v is not the base vertex of A. Then we remove e and its endpoint v from A. 
Note that e does not belong to the boundary of a nontrivial 2-cell of A since boundary labels 
of nontrivial 2-cells are assumed to be cyclically reduced. Therefore, this operation either 
reduces cancellation in the boundary label of A or changes a trivial 2-cell of edge type to a 
trivial 2-cell of vertex type. 

By definition, for all the operations introduced, the base vertex and the boundary loop 
of Ai are inherited in the natural way from those of A. 

Observe that (Tl) and (T2) do not change the boundary label w of A, (Rl) reduces a 
cancellation in w and (R2) either does not change w or reduces a cancellation in w. 

It is easy to see that any cancellation in the boundary label of A can be reduced by either 
(Rl) or (R2) after a sequence of trivial transformations (Tl) and (T2) 



Definition 5.1. We say that a diagram A2 is obtained by folding from a diagram Ai if it is 
the result of application of a sequence of trivial transformations and elementary reductions 
so that the boundary label of A2 is freely reduced. We say also that Ai is obtained by 
unfolding from A2. 

For technical convenience, we assume that after folding, diagram A2 always undergoes 
the following tightening procedure: First, contract any trivial 2-cell whenever it can be 
contracted by using (T2) and any sequence of operations (Tl) . Second, contract all trivial 
edges whenever possible. 
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Below we use the following properties of tight diagrams as in Definition 15.11 The proof of 
the following lemma is an easy exercise and left to the reader. 

Lemma 5.2. Let A be a tight diagram. Then the following assertions are true. 

(i) All trivial 2-cells of A have edge type. If a nontrivial edge e occurs in the boundary 
loop of such a 2-cell D then both e and e~^ occur in the boundary loop of D (so the 
union of D and e is an annulus). In particular, no non-trivial boundary edge of A 
belongs to the boundary of a trivial 2-cell. 

(ii) Any simple loop in the 1-skeleton A*^^^ of A bounds a subdiagram which has at least 
one nontrivial 2-cell. 



With a formal product 



V = U^ ^CiUi . . . U^CmUm 



we associate a rose diagram Aq over the set {ci, . . . , Cm} labelled with (perhaps, not reduced) 
word u, see Fig. [HI Using folding we can produce a new diagram A whose boundary label is 
the freely reduced from of v. (Existence of such a diagram A is essentially the main part of 
the van Kampen lemma.) 




Figure 8. 

We can consider also an inverse process: given a diagram A over a set {ci, . . . , Cm\ with 
boundary label w we can unfold A to get a rose diagram Aq and its associated factorization 

of w in the free group Fy where a is a permutation on the set {1, . . . , m} and c[ is a cyclic 
shift of Cj. The following proposition describes the relationship between two factorizations 
obtained in this way. 

Proposition 5.3. Let A be a diagram with labeling function overY, with boundary label w. 
Let Ai and A2 be two rose diagrams obtained from A by unfolding and let 

W = U^^CiUi . . . U;^CmU^ = t^;(l)C;(i)i;^(l) . . . %l^f'a(m)VaM 

be the associated factorizations of w in Fy where a is a permutation on the set {1, . . . , m} 
and c[ is a cyclic shift of Ci. Then there is an Fy- automorphism cj) o/-Fyu{x;i,...,z„} such that 

\Zi CiZi . . .Z^ CmZm) = ^o-(l)'^o-{l)^o-{l) • • • ^(j{m)^o-{m)^o-{m) 
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and the following diagram is commutative 

Fyu{zi,...,z^} ^ Fyu{zi,...,z„,} 





Fy 

where a and /3 are the corresponding evaluations of variables Zi: 

zf = Mi, z^ = Vi (z = 1, . . . , m). 

Proof. Let vq be the base vertex of Ai and i^i, . . . , z/^ be vertices where disks with boundary 
labels Ci are attached to the other part of Ai (shown in Fig. [8]). Let Zi be the simple path 
between Vi and vq and di the boundary loop of the corresponding 2-cell at Vi (which is 
labelled q). We keep similar notations u[, z[ and d^ for A2. 

Unfolding transformations A — ?■ Aj induce a homotopy equivalence ■?/' between 1-skeletons 
A^ and A2 such that ^(i^o) = V^(^o)- -'^^ particular, we have the induced isomorphism 
7ri(A-j^ \vq) — )■ vri(A2 , z/q) of fundamental groups. Moreover, from the construction of ele- 
mentary operations it is easy to see that ■?/' satisfies the following conditions: 

(i) For any loop p at z/q, the image V'(p) is a loop at v'^ labelled with a word representing 

the same element of Fy. 
(ii) t(j{z~^diZi) has the form w~^d[wi for some Wi up to homotopy in Ag rel z/q. 
(iii) The image il){t) of the boundary loop t of Ai is homotopic in Ag rel u'^ to the 

boundary loop of A2. 

The path z[~^Wi is a loop at i^q, so it has an expression (up to homotopy rel z/g) 

in terms of the generators z[~^d[z[ of 7ri(A2,z/Q). We define formally a homomorphism ip : 

-F{di,...,d™,^i,...,2„} -> -F{<,...,rf',„,4,...,2;,} by 

i){di) = d[ and ^(2;^) = 2;^7i- 

The definition implies that for any loop p at z/q, we have ip{p) = ip{p) up to homotopy rel v'^. 
From this we conclude that all z'~^d'^z'^ belong to the image of il) and hence if) is in fact an 
isomorphism. From (iii) we get 

^-1^ ~ ~-l^ ^ A ^'-1 rl' -y' -y'-l rl' -J 

Zi aiZi ■ ■ ■ Z^ UmZm —^ ^o-(l)"cr{l)'^o-(l) • • • -^(T(m)"o-(m)''cr(m) 

Now we adjust ?/; to get the required (p. By (i) and (ii), we have 

K^i) = gAiwi) for some gi e Fy with g'^Cigi = c- 

We define by 

(j){zi) = giZifi{z:[^c[zu • • • , z-^c'^Zm)) 
It is easy to see that (j) satisfies all the required conditions. D 

To formulate the main result of the section, we need two more definitions. Let e be 
a nontrivial edge of a diagram A. We call e a tree edge if both e and e~^ occur in the 
boundary loop of A (or, in other words, A splits into two disjoint subdiagrams after removal 
of e). We call e tubular if e and e~^ occur in the boundary loop of a trivial 2-cell of A. 
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Proposition 5.4. Let w,ci,C2, ■ ■ ■ ,Cm be cyclically reduced elements of a free group Fy- 
Let A be a diagram with boundary label w over the set {ci, . . . , Cm} and let N denote the total 
length of labels of tree and tubular edges of A. Assume that A is tight as in Definition \5.1\ 
Then there is rose diagram obtained from A by unfolding such that for the associated 
factorization 



w = t'l ^c'^(l)Vl ■ ■ ■ vJc'^^^^)V^ 



where a is a permutation on the indices {1, . . . , m} and c[ is a cyclic shift of Ci, the following 
is true: 

(i) \vi\<j:T=i\ci\+'2N forallt. 
(ii) for any increasing sequence 1 < ti < t2 < ■ ■ ■ < tk < m of indices ti, 

m 

Proof. Suppose that e is an edge in the interior of A such that at least one of the endpoints 
of e belongs to the boundary of A. Then we can apply to A an unfolding operation (Rl)~ 
replacing e by two new boundary edges. 

Starting from A, we perform recursively unfoldings of all interior edges. Then we contract 
all trivial 2-cells using (TI) and (T2). The resulting diagram A has no interior edges and 
no trivial 2-cells. For convenience, we further introduce new trivial edges where needed, so 
that each vertex of A gets valence at most 3 (see Fig. [H]) 





Figure 9. 

We claim that A has exactly A^ tree edges. Indeed, since A is tight, by Lemma I5.2( i) 
any nontrivial interior edge of A either occurs in the boundary loop of a nontrivial 2-cell 
or is tubular. Then every tree edge of A comes either from a tree edge of A (which is not 
changed in the transformation) or from a tubular edge e of A (which becomes a tree edge 
after unfolding e and contracting the trivial 2-cell D with dD D e). 

Let z/i be the base vertex of A and £ be the boundary loop of A. We enumerate cells 
Ri, . . . , Rm of A in the order as their boundary edges meet first in L Let 

{vq} = Ao C Ai c ■ ■ ■ C A„ = a 

be a sequence of subdiagrams of A where each Aj is obtained from Aj„i by attaching the 
topological closure of Ri and a segment Sj joining Ri with Aj_i. We view Sj as a path from 
a vertex /ij in dRi to a vertex ^j in 9Aj_i. The boundary loop of Ri at /ij is labelled with 
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a cyclic shift c'^r^-. of Ca-(j) where a is a permutation on {1, . . . ,m}. Let rj be the terminal 
segment of the boundary loop of Aj„i starting at ^j. Denote Vi = A(sjrj). Then we have 

and from the construction we can easily see that the rose diagram associated with this 
factorization is obtained by unfolding from A successively along paths r^, ?"m-i, ■ ■ ■ , '^i each 
time slicing off one 2-cell Dj. 

Observe that s^rj passes through every tree edge of A at most twice and through every 
other edge of A at most once. This implies 



<^|q|+2A^. 



i=l 

Let us prove (ii). Let 1 < ti < ^2 < ■ ■ ■ < ^fc ^ '^ be an increasing sequence of indices tj. 
Let hi be the component of the intersection dRi fl £ that meets first in L We consider a 
diagram A' obtained by removal from A all 2-cells Ri and all arcs hi with i ^ tj. It is easy 
to see that the boundary label of A' is equal to nj=i "^fT^^aft )^*i- Hence 

k 



X{'"u<^o{ufu 



=1 



< ^|ci| +2N 

i=l 



D 



For the proof of Theorem 11.11 we will use Proposition 15.41 in the special case when w is a 
short product of conjugates of elements Ci, . . . , c^ in the sense of Definition 14.41 We start 
with the following simple observation. 

Lemma 5.5. Let w, Ci, C2, . . . , c^ be cyclically reduced elements of a free group Fy and w 
be a short product of conjugates of ci, . . . , Cm.. Then any diagram A over {ci, . . . , Cm} with 
boundary label w has no tree edges. In particular, \w\ < ^^ |cj|. 

Proof. If e is a tree edge of A then the boundary loop of A, up to a cyclic shift, has the form 
peqe~^ where p and q are loops which bound two subdiagrams Ai and A2 of A. Since w is 
cyclically reduced, the labels of p and q are nontrivial elements of Fy and hence both Ai 
and A2 have nontrivial 2-cells. We get a factorization of w as in Definition 14.41 which 
contradicts the assumption that w is short. 

To prove the second statement, we take a tight diagram A over {ci, . . . , Cm} with boundary 
label w. By the first statement and Lemma [5.2( i). any boundary edge of A belongs to the 
boundary of a nontrivial 2-cell. The total length of their boundary labels is Yli \<^i\- ^ 

Corollary 5.6. Let w,ci,C2, . . . ,Cm be cyclically reduced elements of a free group Fy and 
let w be a short product of conjugates of Ci's. Then, up to re- enumeration and cyclic shifts 
of Ci 's, there exists a factorization 

such that \ui\ < Xli^i kd '^'^'^ /^'^ '^'^?/ increasing sequence of indices 1 < ti < t2 < ■ ■ ■ < 
tk < m. 



\u^^^ct,ut,...u^^ct^ut^ <^ 
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Proof. Take any diagram A over {ci, . . . ,Cm} with boundary label w. Assume that e is 
a tubular edge of A. Let D be the trivial 2-cell of A with boundary loop peqe~^ where 
X{p) = X{q) = 1. One of the loops p or q bounds a subdiagram Ai of A. Then we can 
remove the annulus D U e from A replacing the subdiagram Ai U D U e by Ai. 

Continuing this process we may assume that A has no tubular edges. It remains to apply 
Lemma [5.51 and Proposition 15.41 D 

Proof of Theorem \l.l\ Assume that a quadratic equation Q = 1 is solvable in F^. Let 
ip G Auti;'^(F4ux) be an automorphism given by Proposition 13.31 or Proposition 13.51 such 
that Q"^ is conjugate to a standard orientable or semi-standard non-orientable quadratic 
word R. If q; is a solution of i? = 1 then tfja is a solution oi Q = 1 and for any x G Var((5), 

\x^''\ < 4n(Q) max |x"| + 2c{Q). 

a:GVar(Q) 

We have n{R) < n{Q) and c{R) < c{Q) and we can always assume that x" = 1 for every 
X e Var(Q) \ Var(i?). This implies that the statement of the theorem for Q follows from the 
statement for R. Therefore, it is sufficient to prove the theorem in the case of a standard or 
semi-standard Q. Let 

Q = [Xi, yi] [X2, 2/2] •• • [Xg, yg\ClZ2^C2Z2 . . . Z'^CmZm 



or 



Q = X^X2 . . . XfcfXfc+l, Xfc+2] . . . [Xg-1, Xg]ciZ2 C2Z2 

We can further assume that all Cj are cyclically reduced. Indeed, in case of general Cj's 
we consider the equation Q = 1 where each Cj is replaced with its cyclically reduced form 
Ci = u~'^CiUi. A solution oi Q = 1 can be obtained from a solution oi Q = 1 hj the 
substitution 

{xi i-> UiXiU^'^, yi I-)- UiyiUi'^, Zi ^^ UiZiU^^ for all i). 
It is easy to see that the statement of the theorem for Q = 1 follows from the statement for 
Q = l. 

Now consider two cases. 

Case 1: Q is standard orientable, i.e. 

Q = [Xl,yi][x2,y2] ■ ■ ■ [Xg,yg]CiZ2^C2Z2 



By Proposition 14. 6[ there exists a solution /3 of the equation 

[Xl, yi\[x2, ^2] • • • [Xg, yg]z^^CiZi . . . Z'^CmZm = 1 

such that the cyclically reduced form w oi {z^^ciZi . . . z^^CmZmY is a short product of con- 
jugates of Cj's. In particular, \w\ < c{Q) by Lemma [5751 By Corollary 15.61 there exist a 
permutation cr on {!,..., m\ and elements Mi, ... , Um G Fa of length |Mi| < '2c{Q) such that 

w = n]~^c^(i)Ui . . . n~^c<^(„)nm 

and for any increasing sequence of indices 1 < ti < ^2 < ■ ■ ■ < tfc < m, 

\u7,^c^{h)Ut, ■ . .Mt"/cafe)"*J ^ c(g). 

This implies by Lemma [3.131 that there are elements fi, . . . ,fm of length \vi\ < 3c((5) such 
that 
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By Corollary 14. 3[ w can be represented as a product of g commutators of elements of length 
at most 2c{Q). Hence we get a solution 7 of the equation 

(2) [xi, yi][x2, 1/2] •• • [Xg, yg]z^^cizi . . . Z-^CmZm = 1 

such that \x]\, \y]\ < 2c{Q) and \zj\ < 3c{Q). To get a solution of the original equation Q = 
1, we eliminate zi by applying to the left-hand side of ([2]) the automorphism tp G Aut{FAux) 
defined by 

V^ = (zj I— )■ Zi^XiZi, yi I—)- z^-^yiZi for all i, Zj 1— )■ ZjZi for j > 1). 

This gives a solution ip'^l of Q = 1 with \x^ '^\ < 8c{Q) for all x. 
Case 2: Q is semi-standard non-orientable, i.e. 

Q = X1X2 ■ ■ ■ Xj^yXk+l^ Xk+2\ ■ ■ ■ Fg_l, Xg\CiZ2 C2Z2 ■ ■ ■ Z^ CmZm- 

Similarly to Case 1, for some ei, . . . ,em ^ {~1; 1} we find a solution 7 of another semi- 
standard equation 

Xi . . . Xj.[Xr+l, Xr+2\ ■ ■ ■ pg— 1) -^gl^l ^l ^1 • • • ^m—l^rn^rn ^ -1- 

where \xj\ < 2c{Q) and {zJl < 3c{Q) for all i. By Lemma [3.141 there is an automorphism 
(j) G Aut(-F4ux) carrying its left-hand side, up to conjugation, to a word 

Ql = Xi . . . Xf^[Xk+l, Xk+2] ■ ■ ■ [Xg~l, Xg]Zi C1Z1Z2 C2Z2 ■ ■ ■ Z^ CmZm 

which produces a solution 71 = 0~^7 of Qi = 1 with 

\x^' I < (8^ + 12m + 2)c(Q) for all x. 

Finally, we eliminate zi as in the orientable case which at most triples the bound, and observe 
that 8^ + 12m + 2 < 12n(g). D 

6. Bounding parametric solutions 

In this section we prove Theorem 11.21 

As defined in Section [H by a parametric solution of an equation i? = 1 in a free group Fa 
we mean an FA-homomorphism r] : -F^uVar(£) — ^ -^aut such that E^ = 1 where T is a 
set of parameters. Here instead of the "big" group Faux we consider the group FAuva.r{E) 
involving only variables occurring in Q. It will be convenient to change this point of view 
by introducing formal sets of variables for equations. We assume that an equation E = 1 is 
endowed with a formal finite set of variables V G X such that V ^ Var(i?) (in other words, 
we admit now fictitious variables x & V not occurring in £"). A parametric solution of such 
an equation {E = 1,V) is then an F^-homomorphism /3 : Fauv ~^ ^aut such that E^ = 1. 
If V^ = Var(£') then we get equations and their parametric solutions in the initial sense. 

Transformations of equations are no longer FA-automorphisms (or FA-endomorphisms if 
degenerate transformations are allowed) of the big free group Faux but homomorphisms 
(j) : Fau^ — )■ FauVi where V,Vi C X are finite sets of variables. Since we want (p to be 
"potentially invertible" we require that be a monomorphism. Moreover, we will require 
that the condition given by the following definition should be satisfied. 

Definition 6.1. We call an FA-monomorphism : Fauv — > -^AuVi primitive if the image 
of is a free factor of Fauvi • 
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The main ingredient to the proof of Theorem 11.21 is the following proposition which may 
be viewed as an advanced form of Proposition I4.6[ 

Proposition 6.2. Let {ci, . . . , c^} be a finite set of cyclically reduced elements of Fa- Sup- 
pose that f] : -F4uVar(Q)uz -^ Faut is a parametric solution of a quadratic equation in Fa of 
the form 

\c^ Z-^ C\Z\ . . . 2^j^ CmZm 

where Q is a coefficient- free quadratic word and Z = {zi, . . . , Zm}- 

Then there is a coefficient-free quadratic word R equivalent to Q, a finite set of variables 
V 3 Var(i?) and a parametric solution 9 : Fauvuz ~^ Fa\jt of an equation 

where all Si = 1 if Q is orientable, Si G { — 1,+1} if Q is non-orientable and a is a permu- 
tation on {1,2,..., m}, such that the following assertions are true: 

(i) There is a primitive FA-monomorphism : -F4uVar(Q)uz -^ Fauvuz and an Fa- 
endomorphism u G Endp^{FA[jT) such that 

(,V Z^ CiZi . . . Z,^ C^^Zjji) 

is conjugate to 

and the following diagram is commutative: 

-^AuVar(Q)UZ ^ FauVUZ 

V 

Faut -^ — Faut 

(ii) Let R denote the word obtained by removing from R all variables x with x^ = 1 and 
performing all subsequent cancellations. Then R[9] is cyclically reduced (and hence 
R[6] is the cyclically reduced form of R^ ; recall that W[6] denotes the formal word 
obtained after substitution in W of values x^ of all variables x). 
(iii) There is a Lyndon-van Kampen diagram A with boundary label R^ over the set {c^i , 
. . . , c^"} folded from the rose diagram associated with factorization 

such that any tree or tubular edge of A is labelled by a sigle parameter letter and the 
total number of tree and tubular edges is less than m. 

Proof. We consider first the case of orientable Q (so all Si are 1). 

We describe a certain transformation process. At any moment, we will have the following 
data: 

• A quadratic word R* of the form 

R* = R Z^(i)C<^(l)2<x(l) . . . Z^^^^C^^rn)Za{ni) 

where i? is a coefficient-free quadratic word equivalent to Q; 

• A finite set V of variables such that V 3 Var(i?). 

• A parametric solution 9 : Fauvuz -^ Faut of the equation R* = 1. 
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• A primitive FA-monomorphism : FA\jwax{Q)\jz -^ Fauvuz and an endomorphism 
CO E Endi?^ (Faut) satisfying (i). 

We start with Rq = Q~^z^^ciZi . . . z^^CmZmi Vo = Var((5) and Oq = t] given by the 
hypothesis of the proposition. For and u, we take the identity maps. 

A principal distinction from the proof of Proposition 14.61 is that we cannot use degenerate 
transformations now. We can have variables x with x^ = 1 which we call degenerate. Instead 
of |-R[^]| as one of the inductive parameters, we use |-R[^]| where R is obtained by removal 
of all degenerate variables from R and performing subsequent cancellations. Variables x G 
Var(i?) \ Var(^) which are not degenerate are called cancelled. 

Our inductive parameter now is the pair (|-R^|, |-R[^]|) with lexicographic ordering. The 
whole transformation process consists of steps 1-5 described below. 

We start with describing several elementary transformations of triples {R*,V,6) of the 
described form. There will be two types of them. 

A substitution is given by a new set of variables Vi 3 V, an F^-monomorphism ip : 
Fauvuz —^ FauViUZ and a homomorphism 6i : FauViUZ -> Faut such that 9 = ip6i. The 
new quadratic word R^ is defined as the cyclically reduced form of (R*)^. In most cases 
when substitutions are defined, we change only R and do not change the coefficient part 

A generalization is given by an endomorphism r G Endi?^(F^uT) and a new parametric 
solution 9i oi R* = 1 such that 6 = 6it. In this case we get a new triple {R^, Vi, 6i) where 
Rl = R* and Vi = V. 

Note that in both cases existence of and u satisfying (i) for a triple {R*, V, 9) automat- 
ically implies one for the new triple (i?^, Vi.,9i). So we will not care about condition (i). 

Transferring cancelled subwords. Application condition: a word of the form x'^D occurs 
in R where x G Var(^) and D disappears in R. We apply the substitution ip = {x'^ ^ x'^D^^) 
to R and do not change V and 6. The transformation transfers D to another location in R. 
(Observe that x^Dx~^ cannot occur in R since x would be cancelled otherwise.) 

Cancellation reduction. Application condition: a non-trivial cancellation in R[9] occurs 
between the values of two neighboring variables x'^ and y^ (5, 5 = ±1) and x'^y^ occurs also 
m. R. li X ^ y then for some u, v and w ^ 1, 

{x'^Y = uw and {y^Y = w~^v 

We introduce a new variable z ^ V, take Vi = V^ U {2;} and define ip and 61 by 

ijj = {x'^ h^ x'^z, y^ H- z~^y^) 

and 

[x^y = u, {y^' = V, z^' = w and h^' = h^ for h y^ x,y, z. 

The case x = y is treated in a similar way (see the proof of Proposition 14. 6p . 

Observe that both operations do not change R^, transferring cancelled subwords does not 
change also R[6] and cancellation reduction decreases the length of R[6] by 2\w\ where w is 
the cancellable part. 

Step 1. Assume that R[6] has a cancellation between the values of two neighboring 
variables x"^ and y^. Then a word of the form x'^Dy^ occurs in R where D disappears in R. 
We transfer D to another location so that x^ and y^ become neighbors in R and then apply 
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cancellation reduction decreasing |-R[6']|. We repeat the procedure until R[6] becomes freely 
reduced. 

To make R[9] cyclically reduced, we use one more transformation. 

Conjugation. Take a new variable y ^ V, take V^i = V U {y} and define ip '■ ^auvuz -^ 
FauViuz by 

ip = {x \-> y~^xy for x G Var(_R), Zi -^ Ziy ioi i = 1, . . . , m) 

To define ^i, we take any element u G -Faut for the value y^^ of y and set according to 
equality 6 = ipOi. 

x^^ = ux^u~^ for X G Var(i?) and z^^ = zfu, z = 1, . . . , ?7i. 

For this transformation, we have R^^ = u~^R^u. 

Step 2. If R^ is not cyclically reduced then using conjugation we replace it with its 
cyclically reduced form. Then jump back to Step 1. 

We can assume now that R and 9 satisfy condition (ii). In the rest of the proof, we show 
how to achieve (iii). We observe for the future that using conjugation we can replace R[9] 
with any its cyclic shift not increasing the inductive parameter (|-R^|, |-R[^]|). We introduce 
yet another transformation. 

Splitting a variable. Let x G Var(^). By the definition of R we have x^ ^ 1. Take a 
new variable y ^ V, take Vi = V U {y} and apply to R the substitution ■?/; = (x i— )• xy). 
To define 6i, we take any values x^^ and y^^ such that the product x^^y^^ is reduced and 
equals x^. The values of all other variables are unchanged. 

Starting from now we fix any diagram A with boundary label R[6] over the set {ci, . . . , Cm} 
folded from the rose diagram associated with factorization ([3]). Transformations below will 
include also change of A. 

We introduce a transformation which changes the coefficient part. 

Rearranging coefficients. Let Aq be a rose diagram obtained from A by unfolding, and let 

be the associated factorization of R[6] into a product of conjugates of q's. (Note that we can 
always assume that a cyclic shift of q coincides with q in this factorization, by performing 
extra unfolding operations on the rose diagram.) 

By Lemma [5. 3[ there is an automorphism tp G Aut(F4uyuz) changing only variables in Z 
such that 

-1 -1 i' -1 -1 

^1 CiZi . . . Z,^ CfYiZfYi — >■ 2;^^->Co-(l)^cr(l) • • • ^o-(^)Co-(m)^(T{m) 

and for 9i = ip~^9 we have ^f^ = fj. We apply the substitution ip to get a new triple 
{R*, V, 6i) where the solution 6 and the coefficient part in R* are only changed. 

Step 3. Let e be a tree edge of A and let h = A(e) 7^ 1. The edge e divides A into the 
union A = Ai U e U A2 of e and two subdiagrams Ai and A2. Passing to a cyclic shift of 
R[6] if needed (which can be performed using conjugation) we assume that 

R[e] = uhvh-^ 

where occurrences of h and h~^ are the labels of e and e~^, respectively. Splitting variables 
if necessary we may assume that h and h~^ are values of single variables, that is, 

R = UxiVxi U^ =u, x\ = h, V^ = V and {xlf = h~\ 
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Note that U and V are labels of boundary loops of subdiagrams Ai and A2. The set of 
2-cells of A is partitioned into the set of 2-cells in Ai and in A2. Let 

U^ =W-^Ci,Wi,...wr^Ci^Wi^ 

and 

where 

{1, 2, . . . , m} = {zi, . . . , Zfc} W {zfc+i, ...,im] 
be factorizations obtained from unfoldings of Ai and A2. Then 

is a factorization associated to an unfolding of A. We apply first rearrangement of coefficients 
so that equation R* = 1 gets the form 

R = EF where E = zy^c^Zi, . . . z^^a, Zi., F = z~^ Ci.^.Zi.^, . . . z^^a Zi 

and 

E^ = U^ = u, F^ = hV^h^^ = hvh~\ 

Consider two cases. 

Case 1: Xi = X2- Since a;f = {x2 )^ 7^ 1 we have 6 = —1. 

We apply a generalization transformation which replaces h with a new parameter t. The 
new parametric solution 61 is defined by 

Zj H- Zjht~^ for j = ifc+i, ...,ini 

y^y for y G Var(i?) \ {xj and y = Zi^, . . . ,Zi^ 

In A, we change the label of e to t. Note that we do not change the inductive parameter 
(l-R^I, |-R[6']|) since h is the label of a single edge e and hence \h\ = 1. 

After performing the transformation we start a new iteration of Step 3 checking for another 
tree edge of A. In general, after splittings of variables new tree edges may appear in A. 
However, the total length of labels of tree edges is not changed. Therefore, after finitely 
many iteration steps we either process all tree edges of A or come to Case 2 where the 
inductive parameter decreases. 

Case 2: xi 7^ X2- Then xi occurs either in U or in V. Without loss of generality we 
assume that Xi occurs in U (if Xi occurs in V then we can pass to a cyclic shift of R[6] by 
conjugation and then come to a symmetric situation). Let U = UiXi^U2- 

First we apply the substitution 

i'l = {xi H-)- xiE'^Ui). 
We get 

R-^EF = x^^V-^x^^U^^xiU^^EF 

^ x^^V^^U^^Ex^^U^^XiF 

For the new value x^ of Xi we have 

xl' = {x.U^^EY = U^ 
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Next we apply another substitution 

^2 = (^ij ^ Zi^X];^U2Xi, j = l,...,k) 

to get 

x^^V-^U^^Ex^^U^^XiF % x^^V-^U^^x^^U^^XiEF. 
After application of ipi and ip2 we get a new triple {Rl, V, 6i) with Ri = Xi^U2XiUiVx\ and 

R\' = {U2UiVxif 
which implies 

\Rt'\<\R% 

We jump next to Step 1. Step 3 is finished. 

At this point, we produce a triple [R*, V, 9) satisfying (ii) and a diagram A with boundary 
label R[6] over the set {ci, . . . , Cm} folded from the rose diagram associated with (|3]). The 
diagram A has the property that the label of any its tree edge e is a single parameter 
letter t. Moreover, there are exactly two occurrences of t in R[9] and both are the values of 
one variable x G Var(^). 

Step 4. Let e be a tubular edge of A. Let D be the trivial 2-cell with boundary loop 
epe^^q where p and q are loops with empty labels and p bounds a subdiagram Ai of A. 
Without loss of generality we assume that there are no tubular edges in Ai. We choose any 
non self-intersecting path s joining the base vertex uq with the start of e and unfold A along 
the path sepe~^s~^ as shown in Fig. [TD] (note that D is contracted into an edge after this 
procedure). Then we perform folding of the resulting diagram reducing cancellation in the 
newly appeared copies SiCiPicf^sj"^ and S2e2P2ej^^sJ^ of sepe~^s~^ (Fig. (TD]). The resulting 
diagram A' is the union of two subdiagrams G and A2 with 0n A2 = {^^o}- The subdiagram 
G is obtained from the union of Ai and SiCi by adding trivial 2-cells so that the complement 
G — A consists of annuli formed by the trivial 2-cells and tubular edges that become all 
edges of the path siCi. The subdiagram A2 is obtained from the complement A — Ai by 
contracting the path q into a vertex (to do the contraction, we introduce trivial edges where 
needed to remove self-intersections in q). 

Observe that the boundary label of G is empty and the boundary label of A' is equal 
to the boundary label of A. Since both A and A' can be unfolded form a common rose 
diagram Aq we can perform rearranging coefficients so that factorization (^ is replaced by 
the factorization associated to Aq. We replace also A with A'. 

Similar to Step 3 we further replace occurrences of the label of SiCi in the values x^ of 
variables x by a new parameter letter (only values of variables Zi are changed for which the 
corresponding coefficient Cj is the boundary label of a 2-cell in Ai). 

We perform the described procedure for all tubular edges of A. After this, A becomes 
the union of subdiagrams A and Gi, . . . , G,- with a common vertex z/q and having no other 
intersections. Each subdiagram Gj has empty label and a single tubular edge labelled by a 
parameter letter. It is not hard to see that after the whole procedure no tree edges appear 
in A. 

Step 5. The final procedure is elimination of tree vertices of A, that is, vertices that do 
not belong to the boundary of any 2-cell of A. 

34 





SiCi 




Figure 10. 



Let 1/ be a tree vertex of A. Since R[6] is cyclically reduced, the valence of i> is at least 2. 
We assume without loss of generality that z/ is distinct from the base vertex uq of A (otherwise 
using conjugation we can move z/q to any non-tree boundary vertex of A). 

Let ei, . . . , e^ be all directed edges starting at z/, and let Xi, . . . ,Xd be the corresponding 
variables, so x^ = tj is a parameter letter which is the label of Cj. Since u ^ uq, all occurrences 
of variables Xi in R are of the form . . . x~^XiJ^i ... (i mod d). Hence all occurrences of Xi 
in R have the form . . . x~^DiXi+i . . . [i mod d) where all Di disappear in R. We first apply 
to R the substitution 



(x2 H- D^ ^XiX2, xs i-> D^ ^-Di ^XiXa, . . . , Xk^ D^^ . . .D^ 



XiXk) 



This eliminates X2 from R (it either disappears in R or becomes cancelled in R). For the 
new solution 9i we get x^^ = titi ior i = 2, . . . , d. All occurrences of parameter letters ti in 
the values of other variables (which can be only Zj^s) are of the form . . . t~^ti+i . . . [i mod d). 
Hence we can perform a generalization replacing each titi with a single parameter ti. 

After performing this operation, the length of R[9] is decreased. We get also a new 
diagram Ai which is obtained from A by contracting the edge ei into a vertex. 

We repeat the procedure until we get rid off all tree vertices of A. 

The description of the transformation process is finished. Let A be a diagram obtained 
after all steps 1-5. To prove the proposition, we have only to estimate the total number of 
tree and tubular edges of A. 

Let ei, . . . , Cfc be all tree edges of A and let r be the number of subdiagrams 6, with 
empty label produced at Step 4. Each of the subdiagrams 0j has at least one non-trivial 
2-cell since otherwise it should be contracted to a vertex. The same is true for all connected 
components of the complement A — UjCj. This implies that the number of these connected 
components is at most m — r. On the other hand, this number is precisely k + 1 since each e^ 
joins two components of A — UjCj and A is simply connected. Hence k + r < m. 

The proof of Proposition 16.21 is completed in the case when Q is orientable. In the case of 
non-orientable Q the only difference in the argument is that we admit inerses of coefficient 
factors. (In Case 2 at Step 3, it may happen that xi occurs in R twice with the same 
exponent +1 or —1. Then the corresponding transformation inverses E which occurs in the 
coefficient part.) D 
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Proof of Theorem \1.2[ We start with the case when Q is a standard orientable quadratic 
word. 

We assume that Q has at least one coefficient. If Q is coefficient-free then there exists a 
parametric solution tjq e Hom(Fvar(Q), Ft) of the equation Q = 1 such that the value of each 
variable is either a parameter letter or trivial and any ordinary solution a of Q = 1 in any 
coefficient group Fa may be represented as (pau where G Stab(Q) and u G Hom(Fr, Fa) 
(see Theorem 4 in Section 5 of a survey [12] )• This implies that r/o is a generalization of any 
other parametric parametric solution 77 of Q = 1 (since 77 is an ordinary solution in Faut)- 
In this case, the statement of Theorem 11.21 holds with bound Ix*"^! < 1. 

Let ?7 be a parametric solution of a standard quadratic equation Q = 1 where 

Q = [Xi,yi] [X2, 1/2] •• • [Xg, yg\ClZ2^C2Z2 ■ ■ ■ Z:;^CmZm 

Similarly to the proof of Theorem 11.11 we may assume that all Cj are cyclically reduced. We 
adjust Q and rj by introducing an extra variable Zi with zl = 1 and replacing ci in Q with 
z'^'^CiZi. By Proposition 16.21 with Q := ([xi.yi] . . . [xg,yg])~^ we find an equation of the form 

a finite set of variables V 3 Var(i?) and a parametric solution 9 : Fauvuz -^ Faut which 
satisfy conditions (i)-(iii) of that proposition. 

Let a primitive F^-monomorphism : -FAuVar(Qo)uz ~^ Fauvuz and u G Endp^^FAuT) be 
as in (i), that is, up to conjugation we have 

(4) Q ^ R~ 2;^^^^C^(i)2^(i) . . . Z^(^rn)Ca{m)Za{m) 

and the following diagram is commutative: 



AUVUZ 



(5) FAuVa.r{Qo)UZ 

V 

Faut ^ Faut 

Our strategy is to further transform the equation R = z~h-.Co-{i)Za-(i) ■ ■ ■ z~}.c„(^rn)Za{m) and 
its parametric solution 6 so that we come back to the initial equation Q = 1. After that, 
with a slight adjustment, 6 will give a desired generalization of 77. We will find an "ecomonic" 
transformation which, together with conditions (ii) and (iii) of Proposition 16. 2|, will provide 
the required bound on the size of the resulting generalization of 77. 

As in the proof of Proposition 16. 2[ each moment we have the following data: 

• A quadratic word R* of the form 

R* = R Z^(^)C^(i)2;<^(i) . . . Z^^^^C„^jn)Za{m) 

where R is a. coefficient-free quadratic word equivalent to [xi,yi] . . . [xg,yg]; 

• A parametric solution 9 : Fauvuz -^ Faut of the equation R* = 1. 

• A primitive F^-monomorphism : -FAuVar(Q)uz -^ Fauvuz and an endomorphism 
u G Endp^iFAuT) with (g]) and ([5]). 

Note that we do not include the set V of formal variables here since we do not need to 
introduce new variables and V will not change. 
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In a similar way, we will use two types of elementary transformations: substitutions and 
generalizations. During the transformation process, condition (i) of Proposition 16.21 will be 
automatically held by construction. 

We start now with the pair (i?*, 6) obtained after application of Proposition 16.21 Observe 
that conditions (ii) and (iii) imply the following bound on the total length of values x^ of 
variables in x G Var(^): 

a;eVar(_R) 

Step 1: Transforming the coefficient part. Let A be the diagram satisfying condition (iii) 
of Proposition 16.21 We unfold A using Proposition 15.41 and in a similar way as in the proof of 
Theorem 1 1.1 1 we change the coefficient part of equation R* = 1 and the parametric solution 6 
using Proposition 15.31 so that the equation gets the form 

-ti ^ Z^ C\Z\ . . . Zjj^ CmZm 

and we have 

\z-\ < 3c(Q) + 4m for alH = 1, 2, . . . , m. 

Step 2: Transforming R to the standard form. We apply the procedure described in 
the proof of Proposition l3.9l to make R a product of commutators. Since our transformation 
should apply to R we mimic application of related Nielsen automorphisms to R as application 
of automorphisms to R in the following way. 

Assume that x'^y^ occurs in R an we want to apply to ^ a related Nielsen automorphism 
p = (x^ — )■ x^y~^). There is a subword x^Wy^ of R where W is deleted in R. In particular, 
W^ = 1. Then application of an automorphism (x'^ — )■ x'^y~^W~^) to R produces a new 
word i?i such that -Ri = R^. 

At this step, we change the values of parametric solution 9 on variables of x G Var(^) 
only (and will not change them until the final Step 4). According to Lemma [3. 121 we get 

|x^| < 2c(g) + Am for all x G Var(^). 

Step 3: Transforming the deleted part to the standard form. Recall that variables in 
Var(i?) \ Var(^) are divided into two types: degenerate variables x with x^ = 1 and cancelled 
ones which cancel in R after removal of degenerate variables. 

Let 

R = Wox\'Wixl^ . . . Wk-ixl'Wk 

where R = x^^x^^ . . . x^*" and Wi are deleted in R. Using substitutions (xf — ;■ xfW~^) and 
i^i+i -^ W^r^^m ) ^^ ^^^ move Wj's not changing R. Since ^ is a product of commutators, 
it is easy to see that we can collect all Wi's together at any location between xf and x^^/. 
We reduce R to the form 

R = WR. 

Let x,y E VariW) and a = (x^ i-> x'^y^) G Aut{FAuvuz) be a Nielsen automorphism 
related to R (i.e. (x^y"^)^^ occurs in W). Application of a does not change R whenever x 
is cancelled or both x and y are degenerate. Observing that two cancelled variables cannot 
"cross" in W and using Nielsen automorphisms of the above form we transform W to a. 
product of commutators 

[yi,y2]--- [y2r~i,y2r] 

where at least one variable yj or j/j+i is degenerate in each commutator [|/j,?/j+i]. 
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Finally, we apply a generalization to 9 by assigning a single parameter letter to each 
cancelled variable x. After the transformation, all of R is written as a product of commutators 
and we get 

|x^| < 1 for all X e Var(i?) \ Var(i?) 

Step 4: Eliminating Zi . We do this in the same way as in the proof of Theorem 11.11 The 
resulting bound increases by Qc{Q) + 8m for \x^\ when x G Var(i?) and by ?>c{Q) + Am for 

We have "almost" produced the desired generalization 9 of the initial parametric solution r/, 
with the only difference that 9 is formally defined on a larger set of variables V . This issue 
is solved by the following observation. 

Lemma 6.3. Let Q G -Faux be a standard quadratic word with at least one coefficient. If 
• FAuVar(Q) — ^ F^yjv ^^ a primitive Fjx-monomorphism and Q'^ is conjugate to Q then 

Im = FAuVar(Q) ■ 

Proof. It follows from Definition 16.11 that a primitive FA-monomorphism : -FAuVar(Q) ~^ 
Fa\jv can be extended to an F^-automorphism of -Fauv- Assume that G Auti?^(FAuy) and 
Q'^ is conjugate to Q. We apply a slightly modified version of Proposition 1.4.24 in [21j to 
one cyclic word ui = Q and the tuple of non-cyclic words Mj, i > 2, consisting of all letters 
of A. Observe that there is no elementary Whitehead automorphism p G Aut(FAui/) which 
does not increase the length of all Ui and strictly decreases it for at least one i. Then can 
be represented as a product pip2 . . . pr oi elementary Whitehead automorphisms pi which do 
not change the length of each Ui. We can eliminate permutations and exponent sign changes 
of generators, so each pi can be assumed to be an elementary Whitehead F^-automorphism 
of Fj^uy. An easy inductive argument shows that then for each i, the image Qp^-P'- (viewed 
as a cyclic word) is a quadratic word equivalent to Q and (FAuvariQ))'^' = -FAuvar(Q)- This 

implies that (FAuVar(Q))'^ = -FAuVar(Q)- □ 

The lemma shows that as long as we have a commutative diagram ([5]) where G StaJo{Q) 
then then we can restrict it to FAuVar(Q): 

-FAuVar(Q) ^ -F4uVar(Q) 

V 

Favjt ^ Favjt 

This means precisely that 6' is a generalization of rj. The proof of Theorem 11.21 is finished in 
the case of a standard orientable quadratic equation Q = 1. 

The case when Q is semi-standard non-orientable is treated in a similar way with the 
following changes: 

• The word R* under transformation has the form 

n — ^ ^fT(l)^tT(l)^<7{l) • • • ^(T(m)'^CT(m)^<7M- 

• After Step 1, the equation gets the form 

-Tt Z^ C^ Z\ . . . Z^jj^ C,^ Z-ui 

for some £i, . . . , e^ = ±1- 

• In Step 2, we reduce Rto a. semi-standard form. 
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• In Step 3, we reduce W to a. semi-standard form 

yl---yk[yk+i,yk+2]--- [yr~i,yr] 

where each yi in yf is degenerate and either yi or ?/j+i is degenerate in each commu- 
tator [yi,yi^i]. After the reduction, we move W to the right of the last square factor 
in R so that the whole word R would be written in a semi-standard form. 

• Before Step 4, we apply Lemma [3.141 to get R* = Q. 

We prove the theorem in the general case. Let Q = 1 he any quadratic equation in Fa- 
Let G Auti;'^(FAuvar(Q)) be an automorphism given by Propositions 13.31 or 13.51 such 
that Q'f' is conjugate to a standard or semi-standard quadratic word R equivalent to Q. 
Then defines a one-two-one correspondence r] \-^ (prj between parametric solutions rj of the 
equation [R = 1, Var((5)) with formal set of variables Var(Q) and parametric solutions (prj 
of the equation Q = 1. 

Recall that a parametric solution rji : FAuya.r[E) ~^ Favjt of -E = 1 is a generalization of 
another parametric solution 772 of the same equation E = 1 ii there are an automorphism 
ip G Stab(i?) and an endomorphism u G Endi;'^(F^uT) such that 772 = iprjiu. We extend this 
definition to parametric solutions of equations {E = 1,V) with formal set of variables by 
taking instead of Stab(i?) the group 

Stab(E, V) = {i) e Autp^iFAuv) I E''' is conjugate to E}. 

Clearly, the correspondence rj ^ tprj preserves the relation "771 is a generalization of 772" in 
the new extended version. 

For a parametric solution 6 oi R = 1, let 6 denote the parametric solution of {R = 1, Var((5) 
defined by extending 6 in the following natural way: for each variable x G Vai{Q) \ Var(/?) 
we choose a parameter letter t (which does not occur in parametric words y^ for y G Var(i?)) 
and set x^ = t. Clearly, any parametric solution of [R = l,Var(Q)) has a generalization 
of the form 6 for some 6. It is also obvious that the correspondence 6 \-^ 6 preserves the 
relation "^1 is a generalization of 62" . 

Now let 7] be an arbitrary parametric solution oi Q = 1. To find the required general- 
ization rji of 7], we pass on to the parametric solution ip'^C, of {R = 1, Var((5)) and take its 
generalization of the form 6 for some 6. By the statement of the theorem for the equation 
R = 1, there is a generalization 61 of 6 such that for any x G Var(i?) 

g j 8{c{R) + 2n{R)) if Q is orientable, 

~ ]36n{R){c{R) + 2n{R)) if Q is non-orientable. 

We take 7/1 = ipOi. By the bounds on ip in Propositions 13.31 or 13.51 (and using inequalities 
n{R) < n{Q) and c{R) < c{Q)), for any x G Var((5) we have 



b^M 



<4 ^ \x''\+2c{Q) 

a;GVar(Q) 

f 34n(i?)(c(i?) + 2n{R)) if Q is orientable, 

j U6n{Rf{c{R) + 2n{R)) if Q is non-orientable. 



< 



as required. D 
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